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SCARECROW GENE, PROMOTER AND USES THEREOF 

This application is a continuation-in-part of co- 
pending Application No. 08/842,445, filed April 24, 1997, 
5 which is a continuation-in-part of Application No. 
08/638,617, filed April 26, 1996, now abandoned, the 
disclosures of which are herein incorporated by reference in 
their entirety. 

.0 This invention was made with government support 

under grant number-. GM4 377 8 awarded by the National 
Institute of Health. The government may have certain rights 
in the invention. 

.5 1 . INTRODUCTION 

The present invention generally relates to the 
SCARECROW (SCR) gene family and their promoters. The 
invention more particularly relates to ectopic expression of 
members of the SCARECROW gene family in transgenic plants to 

0 artificially modify plant structures. The invention also 
relates to utilization of the SCARECROW promoter for tissue 
and organ specific expression of heterologous gene products. 

2 . BACKGROUND OF THE INVENTION 
5 Asymmetric cell divisions, in which a cell divides 

to give two daughters with different fates, play an important 
role in the development of all multicellular organisms. In 
plants, because there is no cell migration, the regulation of 
asymmetric cell divisions is of heightened importance in 
0 determining organ morphology. In contrast to animal 
embryogenesis, most plant organs are not formed during 
embryogenesis . Rather, cells that form the apical meristems 
are set aside at the shoot and root poles. These reservoirs 
of stem cells are considered to be the source of all post- 
5 embryonic organ development in plants. A fundamental 

question in developmental biology is how meristems function 
to generate plant organs. 
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2.1. ROOT DEVELOPMENT 

Root organization is established during 
embryogenesis . This organization is propagated during 
post embryonic development by the root raeristem. Following 
5 germination, the development of the post embryonic root is a 
continuous process, wherein a series of initials or stem 
cells continuously divide to perpetuate the pattern 
established in the embryonic root (Steeves & Sussex, 1972, 
Patterns in Plant Development . Englewood Cliffs, NJ: 
10 Prentice-Hall, Inc.). 

2.1.1. ARABIDOPSIS ROOT DEVELOPMENT 

Due to the organization of the Arabidopsis root, .it 
is possible to follow the fate of cells from the meristem to 

15 maturity and identify the progenitors of each cell type 

(Dolan et al . , 1993, Development 119:71-84). The Arabidopsis 
root is a relatively simple and well characterized organ. 
The radial organization of the mature tissues in the 
Arabidopsis root has been likened to tree rings with the 

20 epidermis, cortex, endodermis and pericycle forming radially 
symmetric cell layers that surround the vascular cylinder 
(FIG. 1A) . See also Dolan et al . , 1993, Development 
119:71-84. These mature tissues are derived from four sets 
of stem cells or initials: i) the columella root cap initial; 

25 ii) the pericycle/vascular initial; iii) the 

epidermal /lateral root cap initial; and iv) the 
cortex/endodermal initial (Dolan et al . , 1993, Development 
119:71-84) . It has been shown that these initials undergo 
asymmetric divisions (Scheres et al . , 1995, Development 

30 121:53-62). The cortex/endodermal initial, for example, 
first divides anticlinally (in a transverse orientation) 
(FIG. IB) . This asymmetric division produces another initial 
and a daughter cell. The daughter cell, in turn, expands and 
then divides periclinally (in the longitudinal orientation) 

35 (FIG. IB) . This second asymmetric division produces the 
progenitors of the endodermis and the cortex cell lineages 
(FIG. IB) . 

- 2 - 
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Furthermore, root radial organization in 
Arabidopsis is produced by three distinct developmental 
strategies. First, primary roots employ stem cells, wherein 
initials undergo asymmetric divisions first to regenerate 
5 themselves and then to generate the cell lineages of the root 
(Fig. IB). Second, in the embryo, sequential asymmetric 
divisions subdivide pre-existing tissue to form the cell 
layers of the embryonic root. Finally, lateral roots are 
formed by a strategy of cell proliferation that originates in 
10 differentiated tissues. Remarkably, within a given species, 
all three strategies result in roots with a nearly identical 
radial organization. 

2.1.2. MAIZE ROOT DEVELOPMENT 

15 The root organization of Zea mays (maize) , which is 

a very well characterized member of the grass family, is far 
more complex than the root organization in Arabidopsis. The 
root system of maize consists of primary, embryonic, lateral, 
seminal lateral and adventitious roots. Both primary and 

20 seminal lateral roots are formed during ' embryogenesis , 

wherein the primary root is the first root to emerge during 
germination, followed by the seminal lateral roots formed at 
the scutellar nodal region (Freeling, M. and Walbot, V. 
(1994) , The Maize Handbook, (New York: Springer -Verlag) ; 

25 Hetz, W. et al . , (1996), Plant J. 10:845-857). Both crown 
and prop roots which develop post-embryonically are shoot- 
borne roots, often termed "adventitious". However, since 
these roots are part of the normal development of the plant, 
they are not, strictly speaking, adventitious roots, which 

3 0 are typically formed as a result of injury or hormone 

treatment. Crown roots, representing the major roots of the 
mature plant, are formed at consecutive early nodes of the 
stem beginning with the coleoptilar nodes. Later in 
development, brace or prop roots emerge from nodes above the 

35 soil level (Freeling, M . and Walbot, V. (1994) , The Maize 
Handbook, (New York: Springer- Verlag) ; Hetz, W. et al . , 
(1996), Plant J. 10:845-857). 
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Currently, there are two notably different types of 
organization of root apical meristems: an open and a closed 
meristem. In an "open" meristem, the cell files of the 
mature tissues cannot be traced with much confidence to 
5 distinct initials, and the incipient tissues do not appear to 
have discrete boundary walls between the root proper and the 
root cap (Clowes, F. A. L., 1981, Ann. Bot. 48:761-767). 
Therefore, the interpretation of the organization of the open 
meristem has been problematic (Clowes, F. A. L., 1981, Ann. 
10 Bot. 48:761-767). In a "closed" meristem, however, since 
files of cells converge onto a pole at the root apex, it is 
easy to identify discrete layers in median longitudinal 
sections (Clowes, F. A. L., 1981, Ann. Bot. 48:761-767). 

15 characteristics of the closed meristem (FIG. 23) - However, 
there are important differences. In maize roots, the root 
apical meristem consists of three independent layers of 

to epidermis, cortex and endodermis and the third generates 
20 the root cap, whereas in the Arabidopsis root apical 

lateral root cap (Esau, K., 1977, Anatomy of Seed Plants. 2nd 
ed. (New York: John Wiley & Sons); Esau, K. , 1953, Plant 
Anatomy. (New York: John Wiley & Sons)). 

25 Primary organization of the root apical meristem in 

maize occurs during embryogenesis, (Steeves, T. A. and 
Sussex, I. ft., (1989), Patterns in plant development., 2nd 
ed. (Cambridge University Press)) as in Arabidopsis. There 
are three main phases in embryo development in maize 

30 (FIG. 24) (Freeling, M. and Walbot, V. (1994), The Maize 
Handbook, (New York: Springer-Verlag) ; Steeves, T. A. and 
Sussex, I. M., (1989), Patterns in plant development., 2nd 
ed., (Cambridge University Press); Sheridan, W. F. and Clark, 
J. K. , (1993), Plant J. 3:347-358). As m Arabidopsis, the 

35 very first division of the zygote establishes the initial 
asymmetry of the embryo (FIG. 24A) . However, unlike 
Arao-.dopsis, o-irryor.-c development in maize is characterized 
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by rather irregular cell divisions (Sheridan, W. F. and 
Clark, J. K. , (1993), Plant J. 3:347-358). During the first 
phase, the apical -basal asymmetry of the embryo is 
established, and the embryo is regionalized into suspensor 
5 and embryo proper (FIG. 24B-C) . During the second phase, 

radial asymmetry appears and the embryonic axis and meristems 
are established (FIG. 24D-E) (Clowes, F. A. L . , (1978), New 
Phytol . 80:409-419). Finally, during the third phase, 
vegetative structures such as embryonic roots and leaves are 
10 elaborated (FIG. 24F-G) (Sheridan, W. F. and Clark, J. K. , 
(1993), Plant J. 3:347-358). 

2.1.3. THE QUIESCENT CENTER 

The quiescent center (QC) of root apical meristems 

15 of angiosperms is a population of mitotically inactive cells. 
In the QC of the primary root of maize, for example, the 
average duration of a mitotic cycle is about 200 hours 
compared with only 12 hours in the cells just below the QC 
and 2 8 hours in the cells just above the QC (Clowes, F. A. 

20 L . , (1961), J. Exp. Bot . 9:229-238). Moreover, there are 

also reductions in the rates of synthesis of DNA and protein, 
and corresponding reductions in the amounts of DNA and RNA 
per cell (Clowes, F. A. L . , (1956), New Phytol. 55:29-34). 

Although the precise role of the QC has remained 

25 speculative, it is generally accepted that cells within the 
QC are undifferentiated and, other than the anatomical 
pattern of cell files, lacking in radial pattern information. 
This theory has been supported by ablation studies performed 
in Arabidopsis, wherein, complete laser ablation of the four 

30 central cells in the Arabidopsis QC led to subsequent 

restoration of the QC by cells of the stele. Furthermore, 
laser ablation of only one or two cells in the QC resulted in 
differentiation of surrounding initial cells. Analysis of 
the hobbit mutants further supports these observations. In 

35 the hobbit mutants, there is no functional QC, leading all 
cortex initials to divide into cortex and endodermis during 
embryogenesis (van den Berg, C, et al . , (1995), Nature 

- 5 - 
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378:62-65). Taken together, it is suggested that the QC 
suppresses differentiation of surrounding initials in the 
range of a single cell (van den Berg, C, et al., (1995), 
Nature 378 : 62-65) . 
5 In maize, on which the contemporary view of the 

role of the QC is based (Feldman, L. J., (1984), Amer. J. 
Bot. 71:1308-1314; Freeling, M. and Walbot, V., (1994), The 
maize handbook. (New York: Springer-Verlag) ) , surgical and 
tissue culture systems were developed to study the 

10 organization process of root apical meristems (Feldman, L. 

J., (1976), Planta 128:207-212). Following removal of the QC, 
the remaining root regenerates a new root tip. This process 
appears to involve de novo organization of the QC and the , 
apical meristem (Feldman, L. J., (1976), Planta 128:207-212). 

15 In addition, the excised QC itself is capable of generating a 
new root (Feldman, L. J. and Torrey, J. G., (1976), Amer. J. 
Bot. 63:345-355). This suggests that there is indeed 
sufficient radial pattern information in the QC to allow the 
regeneration of more or less intact roots. 

20 

2.2. GENES REGULATING ROOT STRUCTURE 
Mutations that disrupt the asymmetric divisions of 
the cortex/endodermal initial have been identified and 
characterized (Benfey et al . , 19 93, Development 119:57-70; 

25 Scheres et al . , 1995, Development 121:53-62). short-root 
(shr) and scarecrow (scr) mutants are missing a cell layer 
between the epidermis and the pericycle. In both types of 
mutants, the cortex/endodermal initial divides anticlinally, 
but the subsequent periclinal division that increases the 

30 number of cell layers does not take place (Benfey et al . , 
1993, Development 119:57-70; Scheres et al . , 1995, 
Development 121:53-62). The defect is first apparent in the 
embryo and it extends throughout the entire embryonic axis, 
which includes the embryonic root and hypocotyl (Scheres et 

35 al . , 1995, Development 121:53-62). This is true also for 
other radial organization mutants characterized to date, 
suggesting that radial patterning that occurs during 

- 6 - 
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embryonic development may influence the post -embryonic 
pattern generated by the meristematic initials (Scheres et 
al . , 1995, Development 121:53-62). 

Characterization of the mutant cell layer in shr 
5 indicated that two endodermal -specif ic markers were absent 
(Benfey et al., 1993, Development 119:57-70). This provided 
evidence that the wild-type SHR gene may be involved in the 
specification of endodermis identity. 

10 2.3. GEOTROPISM 

In plants, the capacity for gravitropism has been 
correlated with the presence of amyloplast sedimentation. 
See, e.g., Volkmann and Sievers, 1979, Encyclopedia Plant . 
Physiol., N.S. vol 7, pp. 573-600; Sack, 1991, Intern. Rev. 

15 Cytol. 127:193-252; Bjorkmann, 1992, Adv. Space Res. 12:195- 
201; Poff et al . , in The Physiology of Tropisms , Meyerowitz & 
Somerville (eds) ; Cold Spring Harbor Laboratory Press, 
Plainview, NY (1994) pp. 639-664; Barlow, 1995, Plant Cell 
Environ. 18:951-962. Amyloplast sedimentation only occurs in 

20 cells in specific locations at distinct developmental stages. 
That is, when and where sedimentation occurs is precisely 
regulated (Sack, 1991, Intern. Rev. Cytol. 12 7:193-252). In 
roots, amyloplast sedimentation only occurs in the central 
(columella) cells of the rootcap; as these cells mature into 

25 peripheral cap cells, the amyloplasts no longer sediment 
(Sack Sc. Kiss, 1989, Amer. J. Bot . 76:454-464; Sievers & 
Braun, in The Root Cap: Structure and Function , Wassail et 
al . (eds.), New York: M. Dekker (1996) pp. 31-49). In stems 
of many plants, including Arabidopsis, amyloplast 

30 sedimentation occurs in the starch sheath (endodermis) 

especially in elongating regions of the stem (von Guttenberg, 
Die Physiologischen Scheiden , Handbuch der Pf lanzenanatomie ; 
K. Linsbauer (ed.), Berlin: Gebruder Borncraeger, vol. 5 
(1943) p. 217; Sack, 1987, Can. J. Bot. 65:1514-1519; Sack, 

35 1991, Intern. Rev. Cytol. 127:193-252; Caspar & Pickard, 
1989, Planta 177:185-197; Volkmann et al . , 1993, J. PI. 
Physiol . 142 :710-6) . 

- 7 - 
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Gravitropic mutants have been studied for evidence 
that proves the role of amyloplast sedimentation in gravity- 
sensing. However, many gravitropic mutations affect 
downstream events such as auxin sensitivity or metabolism 
5 (Masson, 1995, BioEssays 17:119-127). Other mutations seem 
to affect gene products that process information from gravity 
sensing. For example, the lazy mutants of higher plants and 
comparable mutants in mosses can clearly sense and respond to 
gravity, but the mutations reverse the normal polarity of the 

10 gravitropic response (Gaiser & Lomax, 1993, Plant Physiol. 
102:339-344; Jenkins et al . , 1986, Plant Cell Environ 9:637- 
644) . Other mutations appear to affect gravitropism of 
specific organs. For example, sgr mutants have defective • 
shoot gravitropism (Fukaki et al . , 1996, Plant Physiol. 

15 110:933-943; Fukaki et al . , 1996, Plant Physiol. 110:945-955; 
Fukaki et al . , 1996, Plant Res. 109:129-137). 

Citation or identification of any reference herein 
shall not be construed as an admission that such reference is 
available as prior art to the present invention. 

20 

3 . SUMMARY OF THE INVENTION 
The structure and function of a regulatory gene, 
SCARECROW (SCR) , is described. The SCR gene is expressed 
specifically in root progenitor tissues of embryos, and in 

25 certain tissues of roots and stems. SCR expression controls 
cell division of certain cell types in roots, and affects the 
organization of root and stem. The present invention relates 
to the SCARECROW (SCR) gene (which encompasses the 
Arabidopsis SCR gene and its orthologs and paralogs) , SCR- 

30 like genes, SCR gene products, (including, but not limited 
to, transcriptional products such as mRNAs, antisense and 
ribozyme molecules, and translational products such as the 
SCR protein, polypeptides, peptides and fusion proteins 
related thereto) , antibodies to SCR gene products, SCR 

35 regulatory regions and the use of the foregoing to improve 
agronomically valuable plants. 

- 8 - 
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The invention is based, in part, on the discovery, 
identification and cloning of the gene responsible for the 
scarecrow phenotype . In contrast to the prevailing view that 
the SCR gene was likely to be involved in the specification 
5 of endodermis, the inventors have determined that the mutant 
cell layer in roots of scr mutants has differentiated 
characteristics of both cortex and endodermis. This is 
consistent with a role for SCR in the regulation of 
asymmetric cell division rather than in specification of the 

10 identity of either cortex or endodermis. The inventors have 
determined also that SCR expression affects the gravitropism 
of plant aerial structures such as the stem. 

One aspect of the invention relates to the 
heterologous expression of SCR genes and related nucleotide 

15 sequences, and specifically the Arabidopsis SCR and maize 
ZCARECROW (ZCR) genes, in stably transformed higher plant 
species. Modulation of SCR and ZCR expression levels can be 
used to advantageously modify root and aerial structures of 
transgenic plants and enhance the agronomic properties of 

20 such plants. 

Another aspect of the invention relates to the use 
of promoters of SCR genes, and specifically the use of the 
Arabidopsis SCR and maize ZCR promoters to control the 
expression of protein and RNA products in plants. Plant SCR 

25 promoters have a variety of uses, including, but not limited 
to, expressing heterologous genes in the embryo, root, root 
nodule and stem of transformed plants. 

The invention is illustrated by working examples, 
described infra, which demonstrate the isolation of the 

30 Arabidopsis SCR gene using insertion mutagenesis. More 

specifically, T-DNA tagging of genomic and cDNA clones of the 
Arabidopsis SCR gene are described. Other working examples 
include the isolation of SCR sequences from plant genomes 
using PCR amplification in combination with screening of 

35 genomic libraries, and heterologous gene expression in 

transgenic plants using SCR promoter expression constructs. 
Additional working examples describe the cloning and 

- 9 - 
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isolation of maize ZCR genes using probes derived from the 
Arabidopsis SCR gene on a maize genomic library. Still other 
working examples describe the characterization of the maize 
ZCR expression pattern in primary and embryonic roots, and 
5 during regeneration of the root tip following excision of the 
QC. 

Structural analysis of the deduced amino acid 
sequence of Arabidopsis SCR protein indicates that SCR 
encodes a transcription factor. Northern analysis, in situ 

10 hybridization analysis and enhancer trap analysis show highly 
localized expression of Arabidopsis SCR and maize ZCR in 
embryos and roots. Genetic analysis shows SCR expression 
also affects gravitropism of aerial structures [e.g. , stems 
and shoots) . This indicates that SCR is also expressed in 

15 those structures. 

Computer analysis of the deduced amino acid 
sequence of Arabidopsis SCR protein with those of Expressed 
Sequence Tag (EST) sequences and genomic sequences in GenBank 
reveals the existence of at least eighteen SCR genes in 

20 Arabidopsis, one SCR gene in maize, four SCR genes in rice, 
and one SCR gene in Brassica. A further aspect of the 
invention relates to the use of such EST sequences to obtain 
larger and/or complete clones of the corresponding SCR gene. 

The various embodiments of the claimed invention 

25 presented herein are by way of illustration only and are in 
no manner intended to limit the scope of the invention. 

3.1. DEFINITIONS 

As used herein, the terms listed below will have 
30 the meanings indicated. 

35S = cauliflower mosaic virus promoter for the 3 5S 

transcript 

cDNA = complementary DNA 

35 cis-regulatory 

element = A promoter sequence 5 ' upstream of the TATA 

box that confers specific regulatory response 
to a promoter containing such an element. A 

- 10 - 
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coding 
sequence 



DNA 
EST 

functional 



promoter may contain one or more cis- 
regulatory elements, each responsible for 
particular regulatory response 



sequence that encodes a complete or partial 
gene product {e.g., a complete protein or a 
fragment thereof) 



deoxyribonucleic acid 
expressed sequence tag 



a functional portion of a promoter is any 
portion of a promoter that is capable of 
causing transcription of a linked gene 
sequence, e.g., a truncated promoter 



gene 
product 



gene 

sequence 



GUS 
gDNA 

heterologous 
gene 



homologous 
promoter 



a gene construct comprising a promoter 
operably linked to a heterologous gene, 
wherein said promoter controls the 
transcription of the heterologous gene 



the RNA or protein encoded by a gene sequence 



sequence that encodes a complete gene product 
{e.g., a complete protein) 



1,3- 3 -Glucuronidase 
genomic DNA 



In the context of gene constructs, a 
heterologous gene means that the gene is 
linked to a promoter that said gene is not 
naturally linked to. The heterologous gene 
may or may not be from the organism 
contributing said promoter. The heterologous 
gene may encode messenger RNA (mRNA) , 
antisense RNA or ribozymes 



a native promoter of a gene that selectively 
hybridizes to the sequence of a SCR gene 
described herein 



essenger RNA 



operably 



WO 00/53723 



PCT/USOO/05875 



linked = A linkage between a promoter and gene sequence 
such that the transcription of said gene 
sequence is controlled by said promoter 

ortholog = related gene in a different plant (e.g., maize 
ZCARECROW gene is an ortholog of the 
Arabidopsis SCR gene) 

paralog = related gene in the same plant {e.g., 

Arabidopsis SCLal is a paralog of Arabidopsis 
SCR gene) 

RNA = ribonucleic acid 

RNase = ribonuclease 

SCR = SCARECROW gene or portion thereof, encompasses 

(italic) SCR and ZCR genes and their orthologs and 
paralogs 

SCR = SCARECROW protein 

scr = scarecrow mutant (e.g., scrl) 



SCL 
ZCR 



SCARECROW-likc gene 



maize ZCARECROW gene, an. ortholog of, for 
example, the Arabidopsis SCR gene 

SCR protein means a protein containing sequences or 

a domain substantially similar to one or more motifs (i.e., 

Motifs I-VI) , preferably MOTIF III (VHIID) , of the 

Arabidopsis SCR protein as shown in FIGS. 13A-F and FIGS. 

15A-S. SCR proteins include SCR ortholog and paralog 

proteins having the structure and activities described 

herein . 

SCR polypeptides and peptides include deleted or 
truncated forms of the SCR protein, and fragments 
corresponding to the SCR motifs described herein. 

SCR fusion proteins encompass proteins in which the 
SCR protein or an SCR polypeptide or peptide is fused to a 
heterologous protein, polypeptide or peptide. 

SCR gene, nucleotides or coding sequences mean 
nucleotides, e.g., gDNA or cDNA encoding SCR protein, SCR 
polypeptides, peptides or SCR fusion proteins. 
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SCR gene products include transcriptional products 
such, as mRNAs, antisense and ribozyrae molecules, as well as 
translational products of the SCR nucleotides described 
herein, including, but not limited to, the SCR protein, 
5 polypeptides, peptides and/or SCR fusion proteins. 

SCR promoter means the regulatory region native to 
the SCR gene in a variety of species, which promotes the 
organ and tissue specific pattern of SCR expression described 
herein. 

10 

4 . BRIEF DESCRIPTION OF THE FIGURES 
FIGS. 1A-B. Schematic of Arabidopsis root anatomy. 
FIG. 1A. Transverse section showing the four tissues, 
epidermis, cortex, endodermis and pericycle that surround the 

15 vascular tissue. In the longitudinal section, the 

epidermal/lateral root cap initials and the cortex/endodermal 
initials are shown at the base of their respective cell 
files. FIG. IB. Schematic of division pattern of the 
cortex/endodermal initial. The initial expands then divides 

20 anticlinally to reproduce itself and a daughter cell. The 

daughter then divides periclinally to produce the progenitors 
of the endodermis and cortex cell lineages. Abbreviations: 
C, cortex; Da, daughter cell,- E, endodermis; In, initial. 

FIGS. 2A-F. Phenotype of scr mutant plants. 

25 FIG. 2A. Shown left to right are 12 -day scr -2, scr-1 and 

wild-type seedlings grown vertically on nutrient agar medium. 
FIG. 2B. 21-day scr-2 mutant plants in soil. FIG. 2C. 
Transverse section through primary root of 7 -day scr-2. FIG. 
2D. Transverse section through primary root of 7-day wild- 

30 type (WT) . FIG. 2E. Transverse section through lateral root 
of 12 -day scr-1 mutant seedling. FIG. 2F. Transverse 
section through root regenerated from scr-1 callus. Bar, 50 
fim. Abbreviations: C, cortex; En, endodermis; Ep, epidermis; 
M, mutant cell layer; P, pericycle; V, vascular tissue. 

35 FIGS. 3A-F. Characterization of the cellular 

identity of the mutant cell layer. FIG. 3A. Endodermis- 
specific Casparian band staining of transverse sections 
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through the primary root of 7-day scr-1 mutant. (Note: the 
histochemical stain also reveals xylem cells in the vascular 
cylinder.) FIG. 3B. Casparian band staining of transverse 
sections through the primary root of 7-day wild-type (WT) . 
5 FIG. 3C. Immunostaining with the endodermis (and a subset of 
vascular tissue) specific JIM13 monoclonal antibodies on 
transverse root sections of scr-2 mutant. FIG. 3D. 
Immunostaining with JIM13 monoclonal antibodies on transverse 
root sections of WT. FIG. 3E. Immunostaining with the JIM7 

10 monoclonal antibody that stains all cell walls on transverse 
root sections of scr-2 mutant. FIG. 3F. Immunostaining with 
JIM7 monoclonal antibodies on transverse root sections of WT . 
Bar, 25 /im. Abbreviations are same as those for description 
of FIGS. 2A-2F and: Ca, casparian strip. 

15 FIGS. 4A-F. Immunostaining. FIG. 4A. 

Immunostaining with the cortex (and epidermis) specific CCRC- 
M2 monoclonal antibodies on transverse root sections of scr-1 
mutant. FIG. 4B. Immunostaining with CCRC-M2 antibodies on 
transverse root sections of scr-2 mutant. FIG. 3C. 

20 Immunostaining with CCRC-M2 antibodies on transverse root 
sections of wild-type (WT) . FIG. 4D . Immunostaining with 
the CCRC-M1 monoclonal antibodies (specific to a cell wall 
epitope found on all cells) on transverse root sections of 
scr-1. FIG. 4E. Immunostaining with CCRC-M1 antibodies on 

25 transverse root sections of scr-2. FIG. 4F. Immunostaining 
with CCRC-M1 antibodies on transverse root sections of WT. 
Bar, 3 0 fxm. Abbreviations are same as those for description 
of FIGS. 2A-2F. 

FIG. 5A-E. Structure of the Arabidopsis SCARECROW 

30 gene. FIG. 5A. Nucleic acid sequence and deduced amino acid 
sequence of the Arabidopsis SCR genomic region (SEQ ID NO:l) 
and (SEQ ID NO-.2) , respectively. Regulatory sequences 
including: (i) TATA box, (ii) ATG start codon, and (iii) 
potential polyadenylation sequence are underlined. Within 

35 the deduced amino acid sequence, homopolymeric repeats are 
underlined. FIG. 5B. Schematic diagram of genomic clone 
indicating possible functional motifs, T-DNA insertion sites 
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and subclones used as probes. Abbreviations: Q,S,P,T, region 
with homopolymeric repeats of these amino acids; b, region 
with similarity to the basic region of bZIP factors; I and 
II, regions with leucine heptad repeats; E, acidic region. 
5 FIG. 5C. Comparison of the charged region found in 

Arabidopsis SCR protein with that found in bZIP transcription 
factors, SCR bZIP-like domain (SEQ ID NO:3) , GCN4 (SEQ ID 
NO:4), TGA1 (SEQ ID NO:5), C-Fos (SEQ ID NO:6), c-JUN (SEQ ID 
N0:7), CREB (SEQ ID NO:8), Opaque-2 (SEQ ID NO:9), OBF2 (SEQ 

10 ID NO:10), RAF-1 (SEQ ID NO:ll). FIG. 5D . Translations of 
EST clones encoding putative peptide having similarities to 
the VHIID domain region of Arabidopsis SCR protein (SEQ ID 
NO-.12), F13896 (SEQ ID NO: 13), Z37192 (SEQ ID NO : 14 ) , and, 
Z25645 (SEQ ID NO: 15) are from Arabidopsis, T18310 (SEQ ID 

15 NO: 17) is from maize and D41474 (SEQ ID NO: 16) is from rice. 
FIG. 5E. The deduced amino acid sequence of the Arabidopsis 
SCARECROW gene (SEQ ID NO: 2) . 

FIGS. 6A-B. Expression of the Arabidopsis 
SCARECROW gene. FIG. 6A. Northern blot of total RNA from 

20 wild-type siliques (Si) , roots (R) , leaves (L) and whole 
seedlings (Sd) hybridized with Arabidopsis SCR probe a and 
with a probe from the Arabidopsis glutamine dehydrogenase 
(GDH) gene (Melo-Oliveira et al . , 1996, Proc. Natl. Acad. 
Sci. USA 93:4718-4723) as a control for RNA integrity. (GDH 

25 expression is lower in siliques than in vegetative tissues.) 
The 1.6 kb band corresponds to the GDH gene and the 
approximately 2.5 kb band corresponds to SCR. Ribosomal RNA 
is shown as a loading control. FIG. 6B. Northern blot of 
Arabidopsis wild-type, scr-1 and scr-2 total RNA, probed with 

30 Arabidopsis SCR probe "a" corresponding to a cDNA sequence 
shown in FIG. 5B, and with the GDH probe. In scr-2 mutant 
additional bands of 4.1 kb and 5.0 kb were detected. 

FIGS. 7A-G. In situ hybridization and enhancer 
trap analyses of Arabidopsis SCR expression. FIG. 7A. SCR 

35 RNA expression detected by in situ hybridization of SCR 

antisense probe to a longitudinal section through the root 
meristem. FIG. 7B . In situ hybridization of SCR antisense 
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probe to a transverse section in the meristematic region. 
FIG . 7C. In situ hybridization of SCR antisense probe to 
late torpedo stage embryo. FIG. 7D. Negative control in 
situ hybridization using a SCR sense probe to a longitudinal 
5 section through the root meristem. FIG. 7E. GUS expression 
in a whole mount in the enhancer trap line, ET199 in primary 
root tip. FIG. 7F. GUS expression in the ET199 line in 
transverse root section in the meristematic region. FIG. 7G. 
GUS expression in ET199 detected in a section through the 

10 root meristem. GUS expression is observed in the 

cortex/endodermal initial, and in the first cell in the 
endodermal cell lineage but not in the first cell of the 
cortex lineage. Expression in two endodermal layers is 
observed higher up in the root because the section was not 

15 median at that point. Bar, 50 /im. Abbreviations are same as 
those in the description of FIGS. 2A-2F. 

FIG. 8. Partial nucleotide sequence (SEQ ID NO: 18) 
and deduced amino acid sequence (SEQ ID NO: 19) of the 
Arabidopsis SCLa.4 gene. 

20 FIG . 9. Partial nucleotide sequence (SEQ ID NO: 20) 

and deduced amino acid sequence (SEQ ID NO: 21) of the 
Arabidopsis SCLa3 gene. 

FIG. 10. Partial nucleotide sequence (SEQ ID 
NO: 22) of the Arabidopsis SCLal gene. 

25 FIG. 11A. Nucleotide sequence (SEQ ID NO:24) and 

deduced amino acid sequence (SEQ ID NO: 25) of the maize Zm- 
Scll fragment. 

FIG. 11B. Partial nucleotide sequence (SEQ ID 
NO: 25) and deduced amino acid sequence (SEQ ID NO: 26) of the 

30 maize SCLml gene (Zm-Scl2) . 

FIG. 12A-B. Nucleotide sequence of rice SCLo3 EST 
clone. FIG. 12A. Sequence of 5' end of EST clone (SEQ ID 
NO:28) . FIG. 12B. Sequence of 3' end of EST clone (SEQ ID 
NO:29) . 

35 FIGS. 13A-F. Comparison of the amino acid sequence 

of members of the SCARECROW family of genes. Conserved 
Motifs I through VI are indicated by dashed line above the 
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aligned sequences. Consensus sequences are shown in bold. 
See Table 1 for the identity and sequence identifier number 
of each of the sequences shown in this Figure. 

FIG. 14. Restriction map of the approximately 8.8 
5 kb Eco RI insert DNA of lambda clone, t643, containing the 
Arabidopsis SCR gene. The locations of the approximately 5.6 
kb Hindlll-SacI fragment subcloned in plasmid LIG 1- 
3/SAC+MoB 2 1SAC, and the SCR coding region are indicated 
below the restriction map. The location of the translational 

10 initiation site of the SCR gene is at the Nco I site at the 
left end of the indicated coding region. The SCR coding 
sequence begins at the translation initiation site and 
extends approximately 1955 nucleotides to its right. E. coli 
DH5a containing plasmid pLIGl-3 /SAC+MoB 2 IS AC, has the ATCC 

15 accession number 98031. 

FIGS. 15A-S. Comparison of the partial and 
complete amino acid sequences of several plant members of the 
SCARECROW family of genes. The amino acid sequences are 
aligned in a manner that maximizes amino acid sequence 

20 similarity and identity among SCR family members. Each 
sequence shown is continuous except where noted otherwise; 
the dots are inserted between two sequence segments in order 
to align homologous segments. "X" in the middle of a 
sequence indicates ambiguity in the corresponding nucleotide 

25 sequence and, possible termination of the ORF at the "X" 
residue site. "X" at the end of a sequence indicates 
termination of the ORF at the "X" residue site. The 
numbering of the amino acid residues is shown at the bottom 
of each figure and is based on the Arabidopsis SCR amino acid 

30 sequence. Conserved Motifs I through VI are indicated by the 
various dashed lines above the figures. The new and old 
names of the family members are shown in FIG. 15A. The 
sequences of SCR, Tfl and Tf4 are of the complete SCR 
protein. See Table 1 for the identity and the sequence 

35 identifier number of each sequence shown in these figures. 

FIGS. 16A-M. The partial nucleotide sequences of 
several plant members of the SCARECROW family of genes. "N" 
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indicates an unknown base. See Table 1 for the identity and 
the sequence identifier number of each sequence shown in 
these figures. 

FIG. 17A. The partial nucleotide sequence (SEQ ID 
5 NO: 66) of the maize ZCR gene. 

FIG. 17B. The partial amino acid sequence (SEQ ID 
NO: 67) of the maize ZCR gene. The underlined sequence shares 
approximately 80% sequence identity with a corresponding 
sequence of Arabidopsis SCR protein. 

10 FIG. 18. Comparison of the partial amino acid 

sequences of several SCR ortholog sequences amplified from 
the genomes of carrot , soybean and spruce . The SCLdl and 
SCLpl sequences each were obtained by PCR amplification using 
a combination of IF and 1R primers. The SCLgl sequence was 

15 obtained by PCR amplification using a combination of IF and 
WP primers. See, for example. Section 5.1.1., infra. The 
amino acid sequences are aligned in a manner that maximizes 
amino acid sequence identity and similarity amongst these 
sequences. Each sequence shown is continuous except where 

20 noted otherwise; the dashes are inserted between two sequence 
segments in order to allow alignment of homologous segments, 
"x" in the middle of a sequence indicates ambiguity in the 
corresponding nucleotide sequence and, possible termination 
of the ORF or existence of an intron at the "x" residue site. 

25 See Table 1 for the identity and the sequence identifier 
number of each sequence shown in this figure. 

FIG. 19. Comparison of promoter activities in 
transgenic lines and roots. Panel a. A stably transformed 
line containing four copies of the B2 subdomain of the 35S 

30 promoter of CaMV upstream of GUS (Benfey et al . , 1990) . GUS 
is expressed in the root tip. Panel b. Roots emerging from 
callus transformed with four copies of the B2 subdomain of 
the 35S promoter fused to GUS. GUS expression can be seen in 
the emerging root tips (arrows) . Panel c. Higher 

35 magnification of a root emerging from the callus in panel b. 
GUS is clearly restricted to the root tip. The morphology of 
roots regenerated from calli often appears abnormal. Panel 
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d. A transgenic plant regenerated from the calli and roots 
shown in panel b. GUS expression in this plants appears to 
be similar to that of the original line shown in panel a. 
Panel e. ET199, a stably transformed line that contains an 
5 enhancer trapping construct with a minimal promoter fused to 
the GUS coding region inserted 1 kb upstream from the SCR 
coding region. GUS expression is primarily in the endodermal 
layer of the root. Panel f. Roots emerging from calli 
transformed with the SCR promoter: : GUS construct. Expression 

10 of the GUS gene appears to be limited to an internal layer 
(arrows) . Panel g. SCR promoter: :GUS transformed root in 
liquid culture. Roots shown in panel f were excised and 
transferred to liquid cultures. GUS expression is primarily 
found in the endodermal layer as in ET199. The expression of 

15 GUS in the quiescent center, as seen here, is also sometimes 
observed in ET199. Bar, 50/zm. 

FIG. 20. Analysis of SCR promoter activity in the 
scr mutant background. Panel a. Roots emerging from scr 
calli transformed with the SCR promoter :: GUS construct. 

20 Roots regenerated from scr calli are very short. GUS 

expression appears to be limited to an internal layer of the 
root (arrows) . Panel b. Root regenerated from transformed 
scr calli and transferred to liquid culture. The scr 
phenotype, a single layer between the epidermis and 

25 pericycle, is easily seen. GUS expression is limited to this 
mutant layer. E, Epidermis. M, Mutant Layer. P, Pericycle. 
Bar, 50/xm. 

FIG. 21. Molecular Complementation of the scr 
mutant. Panels a, c and e. scr transformed with the SCR 

30 promoter : :GUS construct. Panels b, d and f. scr transformed 
with the SCR promoter :: SCR coding region construct. Panels a 
and b. Roots emerging from scr calli. Arrows point to 
several very short roots among many fine root hairs in the 
scr calli transformed with the SCR promoter :: GUS construct. 

35 In contrast, roots from scr calli transformed with the SCR 
promoter :: SCR coding region construct appeared to be 
wild-type in length, suggesting molecular complementation by 
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the transgene. Panels c and d. Transgenic roots in liquid 
culture. The scr roots transformed with the SCR 
promoter: :GUS construct appeared short, while those 
transformed with the SCR promoter :: SCR coding region 
5 construct appeared of wild- type length. Panels e and f. 
Transverse sections through roots emerging from calli. 
Whereas there is only a single cell layer between the 
epidermis and stele in the SCR promoter: :GUS transformed 
root, the radial organization of the root transformed with 

10 the SCR promoter :: SCR coding region appeared identical to 
wild-type, with both cortex and endodermal layers. E, 
epidermis. M, mutant layer. C, cortex. En, Endodermis, P, 
Pericycle. Bar, 50/m 

FIGS. 22A-F. Expression of ZCR in maize root tips. 

15 FIG. 22A. Expression of ZCR is in the endodermal layer and 
extends down through the region of the quiescent center. 
FIGS. 22B-C. Higher magnification showing expression in a 
single cell layer through the quiescent center. FIG. 22D. 
Expression of ZCR in the maize embryonic root. FIG. 22E. 

20 Higher magnification showing expression in the embryonic 
root. FIG. 22F. Expression of ZCR in the maize lateral 
root . 

FIGS. 23 A-B. Root apical meristems of maize and 
Arabidopsis. Both show a type of a closed meristem in which 

25 all files of cells converge onto a pole at the root apex, 

making the boundary between the root proper and the root cap 
discrete. FIG. 23A. A schematic representation of the 
monocotyledonous closed-type of root apical meristem of 
maize. FIG. 23B. A schematic representation of the 

30 dicotyledonous closed-type of root apical meristem of 
Arabidopsis . 

FIGS. 24A-G. Embryo development in Maize. 
FIG. 24A. Three-celled embryo establishing the initial 
asymmetry and showing the first division of a terminal cell. 

3 5 FIGS. 24B-C. Embryos showing embryo proper and suspensor. 
FIGS. 24D-E. Embryos showing radial asymmetry and the 
initial development of shoot and root apical meristems. 
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while the nucleot: 
FIG. 26. 



Embryos showing the elaborate organization of 
ot apical meristeras. 

e Scarecrow gene. The nucleotide and 
d amino acid segunce of the maize scarecrow gene (ZCR) 

des are numbered on the left. 
Amino acid sequence alignment of maize 
ZCR and Arabidopsis SCR. Identical residues are marked by 
asterisks. In addition, three copies of an LXXLL motif are 
10 underlined. 

FIGS. 27A-G. Maize Scarecrow gene expression 
during regeneration of the root apex following excision of 
the QC. FIGS. 27A-B. Immediately after removal of the root 
cap and excision of the QC, no significant alteration in the 
15 expression pattern was observed. FIGS. 27C-D. Maize 
expression pattern 24 hours following excision of the QC. 
These figures show isolated expression of the gene between 
cell files. FIG. 27E. Expression 48 hours following 
excision of the QC. This figure shows that the root tip has 
20 regained much of its normal shape, although the cell files 
have not organized into the converging files seen in normal 
roots. FIG. 27F. Expression 72 hours following excision of 
the QC. At this stage, the expression pattern resembles that 
found in the unexcised root. FIG. 27G. Expression 96 and 
25 100 hours following excission of the QC. At this stage, the 
expression pattern is similar to that in the primary root. 

FIGS. 28A-AH. The partial nucleotide and amino 
acid sequences of Arabidopsis EST's that encode members of 
the SCASBCRCW-like (SCL) gene family. "N" indicates an 
30 unknown base. See Table 2 for the identity and the sequence 
identifier number of each sequence shown in these figures. 

FIG. 29. Alignment of the Arabidopsis GRAS gene 
products. The highly conserved region of the GRAS products 
can be divided into five recognizable motifs, indicated in 
35 the figure. See also, for example, Section 5.1.5., infra. 
The absolutely conserved residues within the VHIID and SAW 
motifs are highlighted in bold, as are the hydrophobic 



WO 00/53723 



PCT/US00/05875 



residues of the leucine heptads, the P-F-Y-R-E residues of 
the PFYRE motif, and the two short sequences that define the 
end of the VHIID motif and the beginning of the PFYRE motif. 
The ® symbol in the alignment indicates the location of an 
5 apparent insertion in the SCL3 gene. The deduced amino acid 
sequence of the insertion is shown at the bottom of the 
figure . 

FIG. 30. RNA Gel Blot. mRNA from siliques (Si) 
and 14 day old shoots (Sh) and roots (R) was isolated and 

10 analyzed by RNA gel blot hybridization with specific 

antisense digoxygenin- labeled probes. The SCLs analyzed are 
all expressed within the roots, and many of them are 
expressed in all of the organs tested. As the amount of • 
mRNA loaded on the gels and the exposure times for all of 

15 these blots varied, direct comparisons of the levels of 
expression are not possible. Detection of SCL1 , however, 
required significantly shorter exposures than the others, and 
SCLS, SCL1 and SCL9 required significantly longer exposures 
and more mRNA. A representative ethidium bromide-stained RNA 

20 gel is shown below as a loading control. 

FIG. 31. In situ Hybridizations with SCR and SCL3 . 
Transverse sections (a, b, and d) and a longitudinal section 
(c) of 7 day old roots were hybridized with either an 
antisense SCR riboprobe (a) , an antisense SCL3 riboprobe (b 

25 and c) or a sense SCL3 riboprobe (d) . Strong signal is 

observed in the endodermis with the antisense SCR probe and 
the antisense SCL3 probe, but not with the sense SCL3 probe. 
Scale bars in (a) and (c) are both 25 mm. The magnification 
is the same in panels (a) , (b) , and (d) . 

3 0 FIG. 32. RNA Blot Analysis. An RNA blot analysis 

in which either total RNA or poly-A selected RNA from roots 
(R) and shoots (S) were probed with the full-length ZCR cDNA. 
The hybridizing band is approximately 2.6 kilobases. 

FIG. 33. CBPBTT44 Partial cDNA and Amino Acid 

35 Sequence. The partial nucleotide and amino acid sequence of 
CBPBTT44, a closely related gene to the maize ZCR gene. 
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FIG. 34. Alignment of the Arabidopsis SCR, the 
maize ZCR and the CBPBTT44 amino acid sequence. As shown in 
bold, all three genes contain the leucine heptad repeats. 
The alignment further shows that all three genes share a high 
5 degree of homology. 

FIG. 35. Southern Blot Analysis. A Southern of 
maize genomic DNA probed with (left) the maize ZCR cDNA, 
wherein the W H" lane represents DNA digested with Hindi I I and 
the "RV" lane represents DNA digested with EcoRV restriction 
10 enzymes; (right) gene-specific probes (A) maize ZCR cDNA for 
comparison; (B) maize ZCR gene-specific probe and (C) 
CBPBTT44 gene- specif ic probe. The results demonstrate that 
CBPBTT44 is the source of the other hybridizing bands picked 
up by the maize ZCR cDNA. 

15 

5 . DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to the SCARECROW (SCR) gene; 
SCARECROW- 1 ike (SCL) genes, SCR gene products, including, but 
not limited to, transcriptional products such as mRNAs, 

2 0 

antisense and ribozyme molecules, translational products such 
as the SCR protein, polypeptides, peptides and fusion 
proteins related thereto; antibodies to SCR gene products; 
SCR regulatory regions; and the use of the foregoing to 
improve agronomically valuable plants. 

25 In summary, the data described herein show the 

identification of SCR, a gene involved in the regulation of a 
specific asymmetric division, in controlling gravitropic 
response in aerial structures, and in controlling pattern 
formation in roots. Sequence analysis shows that the SCR 

30 protein has many hallmarks of transcription factors. In situ 
and marker line expression studies show that SCR is expressed 
in the cortex/endodermal initial of roots before asymmetric 
division occurs, and in the quiescent center of regenerating 
roots. Together, these findings indicate that the SCR gene 

3 ^ regulates key events that establish the asymmetric division 
that generates separate cortex and endodermal cell lineages, 
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and that affect tissue organization of roots. The 
establishment of these lineages is not required for cell 
differentiation to occur, because in the absence of division, 
the resulting cell acquires mature characteristics of both 
5 cortex and endodermal cells. However, it is possible that 
SCR functions to establish the polarity of the initial before 
cell division, or that it is involved in generating an 
external polarity that has an effect on asymmetric cell 
division. 

10 Genetic analysis indicates that SCR expression 

affects gravitropism of plant stems, hypocotyls and shoots. 
This indicates that SCR is expressed also in these aerial 
structures of plants. 

The SCR genes and promoters of the present 

15 invention have a number of important agricultural uses. The 
SCR promoters of the invention may be used in expression 
constructs to express desired heterologous gene products in 
the embryo, root, root nodule, and starch sheath layer in the 
stem of transgenic plants transformed with such constructs. 

2 0 For example, SCR promoters may be used to express disease 
resistance genes such as lysozymes, cecropins, maganins or 
thionins for anti -bacterial protection, or the pathogenesis- 
related (PR) proteins such as glucanases and chitinases for 
anti-fungal protection. SCR promoters also may be used to 

25 express a variety of pest resistance genes in the 

aforementioned plant structures and tissues. Examples of 
useful gene products for controlling nematodes or insects 
include Bacillus thuringiensis endotoxins, protease 
inhibitors, collagenases , chitinase, glucanases, lectins and 

30 glycosidases . 

Gene constructs that express or ectopically express 
SCR, and the SCR- suppression constructs of the invention, may 
be used to alter the root and/or stem structure, and the 
gravitropism of aerial structures of transgenic plants. 

35 Since SCR regulates root cell divisions, overexpression of 
SCR can be used to increase division of certain cells in 
roots and thereby form thicker and stronger roots. Thicker 
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and stronger roots are beneficial in preventing plant 
lodging. Conversely, suppression of SCR expression can be 
used to decrease cell division in roots and thereby form 
thinner roots. Thinner roots are more efficient in uptake of 
5 soil nutrients. Since SCR affects gravitropism of aerial 
structures, overexpression of SCR may be used to develop 
"straighter" transgenic plants that are less susceptible to 
lodging . 

Further, the SCR gene sequence may be used as a 
10 molecular marker for a quantitative trait, e.g., a root or 
gravitropism trait, in molecular breeding of crop plants. 

For purposes of clarity and not by way of 
limitation, the invention is described in the subsections - 
below in terms of (a) SCR genes and nucleotides; (b) SCR gene 
15 products; (c) antibodies to SCR gene products; (d) SCR 

promoters and promoter elements; (e) transgenic plants which 
ectopically express SCR; (f) transgenic plants in which 
endogenous SCR expression is suppressed; and (g) transgenic 
plants in which expression of a transgene of interest is 
20 controlled by the SCR promoter. 

5.1 . SCR GENES 

The SCARECROW genes and nucleotide sequences of the 
invention include: (a) a gene listed below in Tables 1 or 2 

25 {hereinafter, a gene comprising any one of the nucleotide 
sequences shown in FIG. 5A, FIG. 8, FIG. 9, FIG. 10, FIGS. 
11A-B, FIGS. 12A-B, FIGS. 16A-M, FIG. 17A, FIG. 25 or FIGS. 
28A-AH, or a segment of such nucleotide sequences) , or as 
contained in the clones described herein and deposited with 

30 the ATCC (see Section 13, infra) ; (b) a nucleotide sequence 
that encodes a protein comprising any one of the amino acid 
sequences shown in FIG. 5A, FIG. 5D, FIG. 5E, FIG. 8, FIG. 9, 
FIGS. 11A-B, FIGS. 13A-F, FIGS. 15A-S, FIG. 17B, FIG. 18 or 
FIG. 25, or a segment of such amino acid sequences, or that 

35 is encoded by any one of the genes and/or nucleotide 

sequences listed by their sequence identifier numbers in 
Tables 1 or 2 , or any segment of such genes and/or nucleotide 
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sequences, or contained in any one of the clones described 
herein and deposited with the ATCC (see Section 13, infra); 
(c) any gene comprising a nucleotide sequence that hybridizes 
to the complement of any one of the genes and/or nucleotide 
5 sequences listed by their sequence identifier numbers in 

Tables 1 or 2 , or any segment of such genes and/or nucleotide 
sequences, or as contained in any one of the clones described 
herein and deposited with the ATCC, under highly stringent 
conditions, e.g., hybridization to filter-bound DNA in 0.5 M 

10 NaHP0 4 , 7% sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65°C, 
and washing in 0.1xSSC/0.1% SDS at 68 °C (Ausubel F.M. et al . , 
eds., 1989, Current Protocols in Molecular Biology, Vol. I, 
Green Publishing Associates, Inc., and John Wiley & sons, * 
Inc., New York, at p. 2.10.3) and that encodes a gene product 

15 functionally equivalent to SCR gene product encoded 

completely or partly by any one of the genes and/or sequences 
listed in Tables 1 or 2 or any segment of such genes and 
nucleotide sequences, or as contained in any one of the 
clones deposited with the ATCC; (d) any gene comprising a 

20 nucleotide sequence that hybridizes to the complement of any 
one of the sequences listed by their sequence identifier 
numbers in Tables 1 or 2 , or any segment of such nucleotide 
sequences, or as contained in any one of the clones described 
herein and deposited with the ATCC, under less stringent 

25 conditions, such as moderately stringent conditions, e.g., 
washing in 0.2xSSC/0.1% SDS at 42°C (Ausubel et al . , 1989, 
supra) , and which encodes a functionally equivalent SCR gene 
product; (e) any gene comprising a nucleotide sequence that 
hybridizes to the complement of any one of the sequences 

30 listed by their sequence identifier numbers in Tables 1 or 2 
or any segment of such nucleotide sequences, or as contained 
in any one of the clones described herein and deposited with 
the ATCC, under the following low stringency conditions: pre- 
hybridization in hybridization solution (HS) containing 43% 

35 formamide, 5xSSC, 1% SDS , 10% dextran sulfate, 0.1% sarkosyl, 
2% block (Genius kit, Boehringer-Manriheim) , followed by 
hybridization overnight at 30 to 33°C using as a probe a DNA 
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molecule of approximately 1.6 kb of SEQ ID NO : 1 at a 
concentration of 20 ng/ml, followed by washing in 2xSSC/0.1% 
SDS two times for 15 minutes at room temperature and then two 
times at 50°C, and which encodes a functionally equivalent 
5 SCR gene product; and/or (f) any gene comprising a nucleotide 
sequence that encodes a polypeptide or protein containing the 
consensus sequence for SCR (i.e., MOTIF III or VHIID) shown 
in FIGS. 13B-D or a segment of such polypeptide or protein. 
The partial and complete nucleotide and amino acid sequences 
10 of SCR genes and encoded proteins and polypeptides included 
in the invention are listed in Tables 1 or 2 below. 
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Table 1. 

New Name Old Name 
ARABIDOPSIS 

SCLal 1110 



SCha.2 
SCLa3 

SCLa4 
SCLa5 
SCLa6 
SCLal 



SCLalO 
SCLal 1 
SCLal 2 

SCLal 3 
SCR 



Tf4 
3935 

4818 
4871 
12398 
3635 



SCLa8 Tfl 
SCLa9 10 964 



11261 
18652 
23196 

33/08 
Scr 



SCR ORTHOLOGS AND PARALOGS 
EST Clone 1 



SEQ ID NOs 
Nucleotide 3 Amino Acid 



Z25645/33772 

Z34599 

Z37192/1 
N96166 

F13896/7 

F13949 

R29793 

T21627 
H76979 
N96767 



T46205 
N96653 

T78186 
T44774 

T76483 

N37425 

W43803 

W435138 

AA042397 



(9468) 
(21711) 



22 



RICE 
SCLol 
SCLo2 



SCLo3 
SCLoA 

MAIZE 
ZCR 
SCLml 

BRASS I CA 



713 
2504 



3989 
11846 



N.A. 

18310 



D15490 

D40482 
D40607 
D40800 
D41389 

D41474 

C20324 



N.A. 

T18310 
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5 



SEQ ID NOs 

New Name. Old Name EST Clone 1 Nucleotide 3 Amino Acid 

SCLbl 174 H74669 -- 42 
CARROT 

SCLdl N.A. N.A. 60 61 

SOYBEAN 

SCLgl N.A. N.A. 62 63 

SPRUCE 

SCLpl N.A. N.A. 64 65 



1 Each EST clone is identified by its GenBank accession, 
number. Each EST clone corresponds to a deposit of a 
cDNA sequence that matches a part of the nucleotide 
15 sequence of the corresponding SCR ortholog or paralog. 



N.A. = not applicable. 



The partial or complete nucleotide sequence of the SCR 
orthologs and paralogs listed here are shown in FIGS. 
5A, 8, 9, 10, 11A-B, 12A-B, 16A-M, 17A and 25. 



Contains the complete coding sequence of Arabidopsis SCR 
gene . 

Contains the complete amino acid sequence of Arabidopsis 
25 SCLa2, SCLa8 , or SCR protein. 
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Designation 


Accession Numbers 


Complete EST 
Sequence 


Map Position 


SCL1 


225645/33772. 
BIOolo. tJiioob 


AF0360300 


1: m235-g3829 (Rl) 


GAI 


Z34183, Z34599. 
T22782, Y11337, 
Y15193. B62171 




1: ve006-veOO7 
(CIC3G6. 4H9, and 11 C3) 


SCL3 


Z37192/Z37191. 
N96166, B20233, 
B18969 


AF0360301 


1: m213_ 

(CIC 1G8, 4H4. 8G4) 


SCL4 


Z46S50. Z38048. 

Z38085, B22400, 

B23696 

G: AB010700 




5 (genomic clone) 


SCL5 


F13896/F 13897. 
AA395075 


AF0360302 


1: m213 (Rl) 


SCL6 


F 13949 

G:AC004708. 

(WASHU003) 


AF0360303 


4: mi51 

(C(C 2C7. 5B11, 5C11. 10C8) 
(genomic clone) 


SCL7 


R29793 


AF0360304 


3: COs4. m457 
(CIC 8E2. 8E1, 901) 


SCL8 


T21627. H76979. 
N96767, T43670 
AA395639. B77404 


AF0360305 


5: PAP003 
(CIC 11F10) 


SCL9 


T76186. T44774 

G.AC004684. 

B25776 


AF0360306 


2: ve018-nga168 
(CIC 10F12) 
(qenomic clone) 


RGA 


T45793. T46205. 
N96653. Y11336, 
Y15194 




2: ve0l2 

(CIC7C11. 2F4, and 6G2) 


SCL11 


T76483, AA394557. 
AA605493 


AF0360307 


NP 


SCL12 


F15146 






SCL13 
(VHS4) 


F15454, N37425. 
AA720344. 
R29917 
G: Z97343 


AF0360308 


4: g4539-mi112 

(CIC 403. 6G4. 2B8, 5E12. 

7G8, 12B9) 

(genomic clone) 


SCL14 


W43803, W43538, 
AA042397 


AF0360309 


NP 


WHSS) 


N65163 
G: Z99708 






SCL16 


G: AB007645 




5 (genomic clone) 


HGL 


AJ224957 






SCL18 


B10115. B30030. 
G.AC002328 




1: mi209.nga280.nga1 28 
(BAC F20N2) 
(qenomic done) 


SCL19 


226055. B62171. 
B62460 






SCR 


U62798 




3: ve042-ve022 
(CIC 11 G5. 907) 



Tabic 2 
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Functional equivalents of the SCR gene product 
include any plant gene product that regulates plant embryo or 
root development, or, preferably, that regulates root cell 
division or root tissue organization, or affects gravitropism 
5 of plant aerial structures (e.g., stems and hypocotyls) . 
Functional equivalents of the SCR gene product include 
naturally occurring SCR gene products, and mutant SCR gene 
products, whether naturally occurring or engineered. 

The invention also includes nucleic acid molecules, 

10 preferably DNA molecules, that hybridize to, and are 

therefore the complements of the nucleotide sequences (a) 
through (f ) , in the first paragraph of this section. Such 
hybridization conditions may be highly stringent, less highly 
stringent, or low stringency as described above. In 

15 instances wherein the nucleic acid molecules are 

oligonucleotides ("oligos"), highly stringent conditions may 
refer, e.g., to washing in 6xSSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligos) , 48°C (for 17-base oligos) , 55°C 
(for 20-base oligos) , and 60°C (for 23 -base oligos) . These 

2 0 nucleic acid molecules may act as SCR antisense molecules, 
useful, for example, in SCR gene regulation and/or as 
antisense primers in amplification reactions of SCR gene 
and/or nucleic acid sequences. Further, such sequences may 
be used as part of ribozyme and/or triple helix sequences, 

25 also useful for SCR gene regulation. Still further, such 
molecules may be used as components in probing methods 
whereby the presence of a SCARECROW allele may be detected. 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, which are amplified using the 

30 polymerase chain reaction under conditions described in 
Section 5.1.1., infra, and that encode a gene product 
functionally equivalent to a SCR gene product encoded by any 
one of the genes and sequences listed in Tables 1 or 2 or as 
contained in any one of the clones described herein and 

35 deposited with the ATCC. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing gene and/or coding sequences 
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and/or their complements (i.e., antisense or ribozyme 
molecules) ; (b) DNA expression vectors that contain any of 
the foregoing gene and/or coding sequences operatively 
associated with a regulatory element that directs the 
5 expression of the gene and/or coding sequences; and (c) 
genetically engineered host cells that contain any of the 
foregoing gene and/or coding sequences operatively associated 
with a regulatory element that directs the expression of the 
gene and/or coding sequences in the host cell. As used 

10 herein, regulatory elements include, but are not limited to, 
inducible and non- inducible promoters, enhancers, operators 
and other elements known to those skilled in the art that 
drive and regulate expression. 

The invention also encompasses nucleotide sequences 

15 that encode mutant SCR gene products, peptide fragments of 
the SCR gene product, truncated SCR gene products and SCR 
fusion proteins. These gene products include, but are not 
limited to, nucleotide sequences encoding mutant SCR gene 
products; polypeptides or peptides corresponding to one or 

20 more of the Motifs I-VI as shown in FIGS. 13A-F and FIGS. 
15A-S, or the bZIP, VHIID, or leucine heptad domains of the 
SCR, or portions of these motifs and domains; truncated SCR 
gene products in which one or more of the motifs or domains 
is deleted, e.g., a truncated, nonfunctional SCR lacking all 

25 or a portion of the Motifs I-VI as shown in FIGS. 13A-F and 
FIGS. 15A-S, or the bZIP, VHIID, or leucine heptad domains of 
the SCR. Nucleotides encoding fusion proteins may include, 
but are not limited to, full length SCR, truncated SCR or 
peptide fragments of SCR fused to an unrelated protein or 

30 peptide, such as, for example, an enzyme, fluorescent protein 
or luminescent protein which can be used as a marker. 

In particular, the invention includes, for example, 
fragments of SCR genes encoding one or more of the following 
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domains as shown in FIG. 5E: amino acids 1-264, 265-283, 287- 
316, 410-473, 436-473, and 473-653. 

In addition to the gene and/or coding sequences 
described above, homologous SCR genes, and other genes 
5 related by DNA sequence, may be identified and may be readily 
isolated, without undue experimentation, by molecular 
biological techniques well known in the art. More 
specifically, such homologs include, for example, paralogs 
(i.e., members of the SCR gene family occurring in the same 

10 plant) as well as orthologs (i.e., members of the SCR gene 
family which occur in a different plant species) of the 
Arabidopsis SCR gene. 

A specific embodiment of a SCR gene and coding 
sequence of the invention is Arabidopsis SCR (FIGS. 5A and 

15 5E) . Other specific embodiments include the various SCR 
genes and coding sequences listed in Tables 1 or 2 , supra. 

Methods for isolating SCR genes and coding 
sequences are described in detail in Section 5.2, below. 

SCR genes share substantial amino acid sequence 

20 similarities at the protein level and nucleotide sequence 
similarities in their encoding genes. The term 
"substantially similar" or "substantial similarity" when used 
herein with respect to two amino acid sequences means that 
the two sequences have at least 75% identical residues, 

25 preferably at least 85% identical residues and most 

preferably at least 95% identical residues. The same term 
when used herein with respect to two nucleotide sequences 
means that the two sequences have at least 70% identical 
residues, preferably at least 85% identical residues and most 

30 preferably at least 95% identical residues. Determining 
whether two sequences are substantially similar may be 
carried out using any methodologies known to one skilled in 
the art, preferably using computer assisted analysis. For 
example, the alignments shown herein were initially 

35 accomplished by a BLAST search (NCBI using the BLAST network 
server) . The final alignments of SCR family members were 
done manually. 
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Moreover, SCR genes show highly localized 
expression in embryos and, particularly, roots. Such 
expression patterns may be ascertained by Northern 
hybridizations and in situ hybridizations using antisense 
5 probes . 

5.1.1. ISOLATION OF SCR GENES 

The following methods can be used to obtain SCR and 
SCL genes and coding sequences from a wide variety of plants, 

10 including, but not limited to, Arabidopsis thaliana, Zea 
mays, Nicotiana tabacum, Daucus carota, Oryza, Glycine max, 
Lemna gibba and Picea abies. 

Nucleotide sequences encoding an SCR gene, an SCI, 
gene or portions thereof may be obtained by PCR amplification 

15 of plant genomic DNA or cDNA. Useful cDNA sources include 
"free" cDNA preparations (i.e., the products of cDNA 
synthesis) and cloned cDNA in cDNA libraries. Root cDNA 
preparations or libraries are particularly preferred. 

The amplification may use, as the 5 ' -primer (i.e., 

20 forward primer) , a degenerate oligonucleotide that 

corresponds to a segment of a known SCR amino acid sequence, 
preferably from the amino- terminal region. The 3 ■ -primer 
(i.e., reverse primer) may be a degenerate oligonucleotide 
that corresponds to a distal segment of the same known SCR 

25 amino acid sequence (i.e., carboxyl to the sequence that 

corresponds to the 5 1 -primer) . For example, the amino acid 
sequence of the Arabidopsis SCR protein (SEQ ID NO : 2 ) may be 
used to design useful 5' and 3' primers. Preferably, the 
primers corresponds to segments in the Motif III or VHIID 

30 domain of SCR protein (see FIGS. 13B-D and FIGS. 15K-L) . The 
sequence of the optimal degenerate oligonucleotide probe 
corresponding to a known amino acid sequence may be 
determined by standard algorithms known in the art . See for 
example, Sambrook et al . , Molecular Cloning-. A Laboratory 

35 Manual , 2nd ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, Vol 2 (198 9) . 
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Further, for amplification from cDNA sources, the 
3 ' -primer may be an oligonucleotide comprising an 3 1 
oligo(dT) sequence. The amplification also may use as 
primers nucleotide sequences of SCR and SCL genes or coding 
5 sequences (e.g., any one of the scr sequences and EST 
sequences listed in Table 1 and Table 2) . 

PCR amplification can be carried out, e.g., by use 
of a Perkin-Elmer Cetus thermal cycler and Taq polymerase 
(Gene Amp") . One can choose to synthesize several different 

10 degenerate primers for use in the PCR reactions. It also is 
possible to vary the stringency of hybridization conditions 
used in priming the PCR reactions, to allow for greater or 
lesser degrees of nucleotide sequence similarity between the 
degenerate primers and the corresponding sequences in the 

15 cDNA library. One of ordinary skill in the art will know 
that the appropriate amplification conditions and parameters 
depend, in part, on the length and base composition of the 
primers and that such conditions may be determined using 
standard formulae. Protocols for executing all PCR 

20 procedures discussed herein are well known to those skilled 
in the art, and may be found in references such as Gelfand, 
1989, PCR Technology, Principles and Applications for DNA 
Amplification , H . A . Erlich, ed . , Stockton Press, New York; 
and Current Protocols In Molecular Biology , Vol. 2, Ch. 15, 

25 Ausubel et al . , eds 1988, New York, Wiley & Sons, Inc. 

A PCR amplified sequence may be molecularly cloned 
and sequenced. The amplified sequence may be utilized as a 
probe to isolate genomic or cDNA clones of a SCR gene, as 
described below. This, in turn, will permit the 

30 determination of a SCR gene's complete nucleotide sequence, 
including its promoter, the analysis of its expression, and 
the production of its encoded protein, as described infra. 

In a preferred embodiment, PCR amplification of SCR 
gene and/or coding sequences can be carried out according to 

35 the following procedure: 
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PRIMERS : 
Forward: 
Name : 

A. A. code: 
5 DNA Sequence : 

Name : 

A. A. code: 
DNA Sequence : 



(23-mer, 2 inosines, 64- 



SCR5AII 
HFTANQAI 

5 ' CAT/C TTT/C ACI GCI AAT/C CAA/G GCN AT 3 • 

SCR5B (29-mer, 1 inosine, 144-mix) 

VHIID(L/F)D 

5' ACGTCTCGA GTI CAT/C ATA/C/T ATA/C/T GAT/C 
TTN GA 3 1 



Name : 

A . A . code : 
DNA Sequence : 



IF 

LQCAEAV 

(T/OTI CA(A/G) TG(T/C GCI GA (A/G) GCN GT 



SCR3AII (23-mer, 2 inosines, 128-mix) 

PGGPP (H/N/K) (V/L/F)R' 

5' CG/T CCA/C GTG/T TGG IGG ICC NCC NGG 3' 



Name : 

A. A. code: 
15 DNA Sequence : 

Name : 

A. A. code: 
DNA Sequence : 

Name : 
20 A. A. code: 

DNA Sequence: 

I = inosine 
N = A/C/G/T 

Useful primer combinations include the following: 
SCR5AII+SCR3AI I ; SCR5B+SCR3AII ; IF+IR; and IF+4R 



AFQVFNGI 

AT ICC (A/G)TT (A/GJAA IAC (C/T) TG (A/G) AA NGC 



QWPGLFHI 

AT (A/G)TG (A/G) AA IA(A/G) NCC IGG CCA (C/T)TG 



PCR: 

Reaction 



ixture (volume 50 nl) : 



-5 /zl 10X amplification buffer containing Mg (Boehringer- 
Mannheim) 

-1 ill 10 mM dNTP's 

-1 }il forward primer (stock concentration: 8 0 pmol/^1) 
-1 /il reverse primer (80 pmol//zl) 
-DNA (100-300 ng) . 

Begin reaction with "hot start" in which the enzyme is added 
.to the mix only after a brief denaturation at a high 
temperature (8 0°C) 



Cycles : 
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94 °C 30 sec - brief denaturation (to prevent non-specific 
priming) 

80 °C 5 min - apply the enzyme to the tubes (3 0 tubes/round 

at maximum) 

94 °C 5 min - thorough denaturation 

2 times: 94 °C 1 min 

64 °C 5 min 

7 2°C 2 min 
2 times: 94°C 1 min 

62°C 5 min 

72°C 2 min 
2 times: 94 °C 1 min 

60°C 5 min 

72°C 2 min 



(reduce the annealing temperature 2°C 
until 44°C is reached after that: 



every second round) , 



40 times: 94°C 20 sec 

48 D C 1 min 
72°C 2 min 

15 

finally, let cool down to 15°C. 



An SCR or SCL gene coding sequence also may be 
isolated by screening a plant genomic or cDNA library using 
an SCR or SCL nucleotide sequence (e.g. , the sequence of any 
of the SCR or SCL genes and sequences and EST clone sequences 
listed in Table 1 and Table 2.) as a hybridization probe. 
For example, the whole, or a segment, of the Arabidopsis SCR 
nucleotide sequence (FIG. 5A) may be used. Alternatively, a 
SCR or SCL gene may be isolated from such libraries using a 
degenerate oligonucleotide that corresponds to a segment of a 
SCR amino acid sequence as a probe. For example, a 
degenerate oligonucleotide probe corresponding to a segment 
of the Arabidopsis SCR amino acid sequence (FIG. 5E) may be 
used . 

In preparation of cDNA libraries, total RNA is 
isolated from plant tissues, preferably roots. Poly (A) + RNA 
is isolated from the total RNA, and cDNA prepared from the 
poly (A) + RNA, all using standard procedures. See, for 
example, Sambrook et al . , Molecular Cloning: A Laboratory 
Manual, 2d ed . , Vol. 2 (1989) . The cDNAs may be synthesized 
with a restriction enzyme site at their 3 ' -ends by using an 
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appropriate primer and further have linkers or adaptors 
attached at their 5 ' -ends to facilitate the insertion of the 
cDNAs into suitable cDNA cloning vectors. Alternatively, 
adaptors or linkers may be attached to the cDNAs after the 
5 completion of cDNA synthesis. 

In preparation of genomic libraries, plant DNA is 
isolated and fragments are generated, some of which will 
encode parts of the whole SCR protein. The DNA may be 
cleaved at specific sites using various restriction enzymes. 

10 Alternatively, one may use DNase in the presence of manganese 
to fragment the DNA, or the DNA can be physically sheared, as 
for example, by sonication. The DNA fragments can then be 
separated according to size by standard techniques, 
including, but not limited to, agarose and polyacrylamide gel 

15 electrophoresis, column chromatography and sucrose gradient 
centrif ugation . 

The genomic DNA or cDNA fragments can be inserted 
into suitable vectors, including, but not limited to, 
plasmids, cosmids, bacteriophages lambda or T 4 , and yeast 

20 artificial chromosome (YAC) [See, for example, Sambrook et 
al . , Molecular Cloning: A Laboratory Manual , 2d ed. , Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York 
(1989),- Glover, D.M(ed.), DNA Cloning: A Practical Approach , 
MRL Press, Ltd., Oxford, U.K., Vols. I and II (1985)]. 

25 The SCR or SCL nucleotide probe, DNA or RNA, should 

be at least 17 nucleotides, preferably at least 26 
nucleotides, and most preferably at least 50 nucleotides in 
length. The nucleotide probe is hybridized under moderate 
stringency conditions and washed either under moderate, or 

30 preferably under high stringency conditions. Clones in 

libraries with insert DNA having substantial homology to the 
SCR or SCL probe will hybridize to the probe. Hybridization 
of the nucleotide probe to genomic or cDNA libraries is 
carried out using methods known in the art. One of ordinary 

35 skill in the art will know that the appropriate hybridization 
and wash conditions depend on the length and base composition 
of the probe and that such conditions may be determined using 
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standard formulae. See, for example, Sambrook et al . , 
Molecular Cloning: A Laboratory Manual , 2nd ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, Vol. 2, 
(1989) pp 11.45-11.57 and 15.55-15.57. 
5 The identity of a cloned or amplified SCR gene 

sequence can be verified by comparing the amino acid 
sequences of its three open reading frames with the amino 
acid sequence of a SCR gene (e.g., Arabidopsis SCR protein 
[SEQ ID No: 2]) . A SCR gene or coding sequence encodes a 

10 protein or polypeptide whose amino acid sequence is 

substantially similar to that of a SCR protein or polypeptide 
(e.g., the amino acid sequence of any one of the SCR proteins 
and/or polypeptides shown in FIG. 5A, 5E, FIG. 8, FIG. 9, . 
FIGS. 11A-B, FIGS . 15A-S, FIG. 17B, FIG. 18 and FIG. 25). 

15 The identity of the cloned or amplified SCR gene sequence may 
be further verified by examining its expression pattern, 
which should show highly localized expression in the embryo 
and/or root of the plant from which the SCR gene sequence was 
isolated. 

20 Comparison of the amino acid sequences encoded by a 

cloned or amplified sequence may reveal that it does not 
contain the entire SCR gene or its promoter. In such a case, 
the cloned or amplified SCR gene sequence may be used as a 
probe to screen a genomic library for clones having inserts 

25 that overlap the cloned or amplified SCR gene sequence. A 
complete SCR gene and its promoter may be reconstructed by 
splicing the overlapping SCR gene sequences. 

5.1.2. EXPRESSION OF SCR GENE PRODUCTS 
30 SCR proteins, polypeptides and peptide fragments, 

mutated, truncated or deleted forms of SCR and/or SCR fusion 
proteins can be prepared for a variety of uses, including, 
but not limited to, the generation of antibodies, as reagents 
in assays, the identification of other cellular gene products 
35 involved in regulation of root development; etc. 

SCR translational products include, but are not 
limited to, those proteins and polypeptides encoded by the 
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SCR gene sequences described in Section 5.1, above. The 
invention encompasses proteins that are functionally 
equivalent to the SCR gene products described in Section 5.1. 
Such a SCR gene product may contain one or more deletions, 
5 additions or substitutions of SCR amino acid residues within 
the amino acid sequence encoded by any one of the SCR gene 
sequences described, above, in Section 5.1, but which result 
in a silent change, thus producing a functionally equivalent 
SCR gene product. Amino acid substitutions may be made on 

10 the basis of similarity in polarity, charge, solubility, 

hydrophobic! ty, hydrophilicity and/or the amphipathic nature 
of the residues involved. 

For example, nonpolar (hydrophobic) amino acids 
include alanine, leucine, isoleucine, valine, proline, 

15 phenylalanine, tryptophan and methionine; polar neutral amino 
acids include glycine, serine, threonine, cysteine, tyrosine, 
asparagine and glutamine; positively charged (basic) amino 
acids include arginine, lysine and histidine; and negatively 
charged (acidic) amino acids include aspartic acid and 

20 glutamic acid. "Functionally equivalent", as utilized 
herein, refers to a protein capable of exhibiting a 
substantially similar in vivo activity as the endogenous SCR 
gene products encoded by the SCR gene sequences described in 
Section 5.1, above. Alternatively, "functionally equivalent" 

25 may refer to peptides capable of regulating gene expression 
in a manner substantially similar to the way in which the 
corresponding portion of the endogenous SCR gene product 
would . 

The invention also encompasses mutant SCR proteins 
30 and polypeptides that are not functionally equivalent to the 
gene products described in Section 5.1. Such a mutant SCR 
protein or polypeptide may contain one or more deletions, 
additions or substitutions of SCR amino acid residues within 
the amino acid sequence encoded by any one the SCR gene 
35 sequences described above' in Section 5.1., and which result 
in loss of one or more functions of the SCR protein (e.g., 
recognition of a specific nucleic sequence, binding of a 
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transcription factor, etc.), thus producing a SCR gene 
product not functionally equivalent to the wild-type SCR 
protein. 

While random mutations can be made to SCR DNA 
5 (using random mutagenesis techniques well known to those 
skilled in the art) and the resulting mutant SCRs tested for 
activity, site-directed mutations of the SCR gene and/or 
coding sequence can be engineered (using site-directed 
mutagenesis techniques well known to those skilled in the 

10 art) to generate mutant SCRs with increased function, (e.g., 
resulting in improved root formation) , or decreased function 
(e.g., resulting in suboptimal root function). In 
particular, mutated SCR proteins in which any of the domains 
shown in FIGS. 13A-F are deleted or mutated are within the 

15 scope of the invention. Additionally, peptides corresponding 
to one or more domains of the SCR (e.g., shown in FIGS. 13 A- 
F) , truncated or deleted SCRs, as well as fusion proteins in 
which the full length SCR, a SCR polypeptide or peptide fused 
to an unrelated protein are also within the scope of the 

20 invention and can be designed on the basis of the SCR 

nucleotide and SCR amino acid sequences disclosed in Section 
5.1. above . 

While the SCR polypeptides and peptides can be 
chemically synthesized (e.g., see Creighton, 1983, Proteins: 

25 Structures and Molecular Principles, W.H. Freeman & Co., 

N.Y.) large polypeptides derived from SCR and the full length 
SCR may advantageously be produced by recombinant DNA 
technology using techniques well known to those skilled in 
the art for expressing nucleic acid sequences. 

30 Methods which are well known to those skilled in 

the art can be used to construct expression vectors 
containing SCR protein coding sequences and appropriate 
transcriptional/translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, 

35 synthetic techniques and in vivo recombination/genetic 

recombination. See, for example, the techniques described in 
Sambrook et al . , 1989, supra, and Ausubel et al . , 1989, 
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supra. Alternatively, RNA capable of encoding SCR protein 
sequences may be chemically synthesized using, for example, 
synthesizers. See, for example, the techniques described in 
"Oligonucleotide Synthesis", 1984, Gait, M.J. ed. , IRL Press, 
5 Oxford. 

A variety of host-expression vector systems may be 
utilized to express the SCR gene products of the invention. 
Such host-expression systems represent vehicles by which the 
SCR gene products of interest may be produced and 

10 subsequently recovered and/or purified from the culture or 
plant (using purification methods well known to those skilled 
in the art) , but also represent cells which may, when 
transformed or transfected with the appropriate nucleotide' 
coding sequences, exhibit the SCR protein of the invention in 

15 situ. These include, but are not limited to, microorganisms 
such as bacteria (e.g., E . coli, B. subtilis) transformed 
with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing SCR protein coding sequences; 
yeast {e.g., Saccharomyces , Pichia) transformed with 

2 0 recombinant yeast expression vectors containing the SCR 

protein coding sequences; insect cell systems infected with 
recombinant virus expression vectors (e.g., baculovirus) 
containing the SCR protein coding sequences; plant cell 
systems infected with recombinant virus expression vectors 

25 (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
TMV) or transformed with recombinant plasmid expression 
vectors (e.g., Ti plasmid) containing SCR protein coding 
sequences; or mammalian cell systems (e.g., COS, CHO, BHK, 
293, 3T3) harboring recombinant expression constructs 

30 containing promoters derived from the genome of mammalian 
cells (e.g., metallothionein promoter) or from mammalian 
viruses (e.g., the adenovirus late promoter; the vaccinia 
virus 7.5K promoter; the cytomegalovirus promoter/enhancer; 
etc . ) . 

35 In bacterial systems, a number of expression 

vectors may be advantageously selected depending upon the use 
intended for the SCR protein being expressed. For example, 
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when a large quantity of such a protein is to be produced, 
for the generation of antibodies or to screen peptide 
libraries, for example, vectors which direct the expression 
of high levels of fusion protein products that are readily 
5 purified may be desirable. Such vectors include, but are not 
limited, to the E. coli expression vector pUR278 (Ruther et 
al . , 1983, EMBO J. 2:1791), in which the SCR coding sequence 
may be ligated individually into the vector in frame with the 
lac Z coding region so that a fusion protein is produced; pIN 

10 vectors (Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101- 
3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 264:5503- 
5509); and the like. pGEX vectors may also be used to 
express foreign polypeptides as fusion proteins with gluta- 
thione S-transf erase (GST). In general, such fusion proteins 

15 are soluble and can be easily purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution 
in the presence of free glutathione. The pGEX vectors are 
designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target gene protein can be released 

20 from the GST moiety. 

In one such embodiment of a bacterial system, full 
length cDNA sequences are appended with in- frame Bam HI sites 
at the amino terminus and Eco RI sites at the carboxyl 
terminus using standard PCR methodologies (Innis et al . , 

25 1990, supra) and ligated into the pGEX-2TK vector (Pharmacia, 
Uppsala, Sweden) . The resulting cDNA construct contains a 
kinase recognition site at the amino terminus for radioactive 
labelling and glutathione S-transf erase sequences at the 
carboxyl terminus for affinity purification (Nilsson, et al . , 

30 1985, EMBO J. 4: 1075; Zabeau and Stanley, 1982, EMBO J. 1: 
1217) . 

The recombinant constructs of the present invention 
may include a selectable marker for propagation of the 
construct. For example, a construct to be propagated in 
35 bacteria preferably contains an antibiotic resistance gene, 
such as one that confers resistance to kanamycin, 
tetracycline, streptomycin or chloramphenicol. Suitable 
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vectors for propagating the construct include plasmids, 
cosmids, bacteriophages or viruses, to name but a few. 

In addition, the recombinant constructs may include 
plant-expressible, selectable or screenable marker genes for 
5 isolating, identifying or tracking plant cells transformed by 
these constructs. Selectable markers include, but are not 
limited to, genes that confer antibiotic resistance, (e.g. , 
resistance to kanamycin or hygromycin) or herbicide 
resistance (e.g., resistance to sulfonylurea, 

10 phosphinothricin or glyphosate) . Screenable markers include, 
but are not be limited to, genes encoding ^-glucuronidase 
(Jefferson, 1987, Plant Mol . Biol. Rep. 5:387-405), 
luciferase (Ow et al . , 1986, Science 234:856-859) and B 
protein that regulates anthocyanin pigment production (Goff 

15 et al., 1990, EMBO J 9:2517-2522). 

In embodiments of the present invention which 
utilize the Agrobacterium tumefacien system for transforming 
plants (see infra) , the recombinant constructs may 
additionally comprise at least the right T-DNA border 

20 sequences flanking the DNA sequences to be transformed into 
the plant cell. Alternatively, the recombinant constructs 
may comprise the right and left T-DNA border sequences 
flanking the DNA sequence. The proper design and 
construction of such T-DNA based transformation vectors are 

25 well known to those skilled in the art. 

5.1.3. ANTIBODIES TO SCR PROTEINS AND POLYPEPTIDES 
Antibodies that specifically recognize one or more 
epitopes of SCR, or epitopes of conserved variants of SCR, or 

3 0 peptide fragments of the SCR are also encompassed by the 

invention. Such antibodies include, but are not limited to, 
polyclonal antibodies, monoclonal antibodies (mAbs) , 
humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab');, fragments, fragments produced by a Fab 

35 expression library, anti-idiotypic (anti-Id) antibodies and 
epitope-binding fragments of any of the above. 
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For the production of antibodies, various host 
animals may be immunized by injection with the SCR protein, 
an SCR peptide (e.g., one corresponding to a functional 
domain of the protein) , a truncated SCR polypeptide (SCR in 
5 which one or more domains has been deleted) , functional 

equivalents of the SCR protein or mutants of the SCR protein. 
Such SCR proteins, polypeptides, peptides or fusion proteins 
can be prepared and obtained as described in Section 5.1.2. 
supra. Host animals may include, but are not limited to, 

10 rabbits, mice and rats, to name but a few. Various adjuvants 
may be used to increase the immunological response, depending 
on the host species, including, but not limited to, Freund ' s 
(complete and incomplete) , mineral gels such as aluminum . 
hydroxide, surface active substances such as lysolecithin, 

15 pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, dinitrophenol and potentially 
useful human adjuvants such as BCG (bacille Calmette-Guerin) 
and Corynebacterium parvum. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from 

20 the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous 
populations of antibodies to a particular antigen, may be 
obtained by any technique which provides for the production 
of antibody molecules by continuous cell lines in culture. 

25 These include, but are not limited to, the hybridoma 

technique of Kohler and Milstein, (Nature 256:495-497 [1975]; 
and U.S. Patent No. 4,376,110), the human B-cell hybridoma 
technique (Kosbor et al . , 1983, Immunology Today 4:72; Cole 
et al . , 1983, Proc . Natl. Acad. Sci . USA 80:2026-2030) and 

30 the EBV-hybridoma technique (Cole et al . , 1985, Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77- 
96) . Such antibodies may be of any immunoglobulin class 
including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 
.The hybridoma producing the mAb of this invention may be 

35 cultivated in vitro or in vivo. Production of high titers of 
mAbs in vivo makes this the presently preferred method of 
production . 
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In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al . , 1984, 
Proc. Natl. Acad. Sci., 81:6851-6855; Neuberger et al . , 1984, 
Nature, 312:604-608; Takeda et al . , 1985, Nature, 314:452- 
5 454) by splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 
can be used. A chimeric antibody is a molecule in which 
different portions are derived from different animal species, 

10 such as those having a variable region derived from a murine 
mAb and a human immunoglobulin constant region. 

In addition, techniques have been developed for the 
production of humanized antibodies. (See, e.g., Queen, U.S. 
Patent No. 5,585,089.) An immunoglobulin light or heavy 

15 chain variable region consists of a "framework" region 

interrupted by three hypervariable regions, referred to as 
complementarity determining regions (CDRs) . The extent of 
the framework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", 

2 0 Kabat, E. et al . , U.S. Department of Health and Human 

Services (1983)). Briefly, humanized antibodies are antibody 
molecules from non-human species having one or more CDRs from 
the non-human species and a framework region from a human 
immunoglobulin molecule. 

25 Alternatively, techniques described for the 

production of single chain antibodies (U.S. Patent 4,946,778; 
Bird, 1988, Science 242:423-426; Huston et al . , 1988, Proc. 
Natl. Acad. Sci. USA 85:5879-5883; and Ward et al., 1989, 
Nature 334:544-546) can be adapted to produce single chain 

30 antibodies against SCR proteins or polypeptides. Single 
chain antibodies are formed by linking the heavy and light 
chain fragments of the Fv region via an amino acid bridge, 
resulting in a single chain polypeptide. 

Antibody fragments which recognize specific 

35 epitopes may be generated by known techniques. For example, 
such fragments include, but are not limited to: the F(ab') 2 
fragments which can be produced by pepsin digestion of the 
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antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab") 2 
fragments. Alternatively, Fab expression libraries may be 
constructed (Huse et al . , 1989, Science, 246:1275-1281) to 
5 allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

Antibodies to a SCR protein and/or polypeptide can, 
in turn, be utilized to generate anti-idiotype antibodies 
that "mimic" SCR, using techniques well known to those 
10 skilled in the art. (See, e.g., Greenspan & Bona, 1993, 
FASEB J 7 (5) :437-444; and Nissinoff, 1991, J. Immunol. 
147 (8) :2429-2438) . 

5.1.4. SCR GENE OR GENE PRODUCTS AS 

15 MARKERS FOR QUANTITATIVE TRAIT LOCI 

Any of the nucleotide sequences (including EST 
clone sequences) described in §§ 5.1 and 5.1.1. and/or listed 
in Tables 1 or 2, and/or polypeptides and proteins described 
in §§ 5.1.2. and/or listed in Tables 1 or 2 , can be used as 

20 markers for quantitative trait loci in breeding programs for 
crop plants. To this end, the nucleic acid molecules, 
including, but not limited to, full length SCR coding 
sequences, and/or partial sequences (ESTs) , can be used in 
hybridization and/or DNA amplification assays to identify the 

25 endogenous SCR genes, scr mutant alleles and/or SCR 

expression products in cultivars as compared to wild-type 
plants. They can be used also as markers for linkage 
analysis of quantitative trait loci. It is possible also 
that the SCR gene may encode a product responsible for a 

30 qualitative trait that is desirable in a crop breeding 

program. Alternatively, the SCR protein, peptides and/or 
antibodies can be used as reagents in immunoassays to detect 
expression of the SCR gene in cultivars and wild- type plants. 
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5.1.5. SCR-LIKE GENES 

Scarecrow- like (SCL) genes are genes which show a 
high degree of similarity to the SCR gene. Tables 1 and 2 
show a list of various SCL genes which were recently- 
identified. Tables 1 and 2 also show each EST clone and/or 
genomic sequence corresponding with each of the SCL genes. 
The partial nucleotide sequence of various Arabidopsis EST's 
that encode members of the SCL gene family are shown in FIGS. 
28A-AH. 

Sequence analysis of the genes showed that a 
variable amino- terminal (N- terminal) and a highly conserved 
carboxyl- termini (C- termini) region exist throughout these 
putative gene products. The highly conserved region does not 
show significant similarity to members of any recognized gene 
family, indicating that these sequences likely define a novel 
gene family. Based on the high degree of similarity of the 
gene products to SCR, the genes corresponding to these ESTs 
were designated SCARECROW- LIKE (SCL) . Recently, the 
importance of this gene family has been confirmed. Two 
components of the gibberellin signal transduction pathway, 
the gene products of the GIBBERELLIN -ACID INSENSITIVE (GAI) 
and the REPRESSOR OF GAI (RGA) loci, have been shown to be 
members of this family (Peng et al . , 1997, Genes & Dev. 11, 
3194-3205; Silverstone et al . , 1998, Plant Cell 10, 155-169). 
Thus, this family of gene products has been designated as the 
GRAS gene family, an acronym based on the designations of the 
known genes: GAI, RGA and SCR. An alignment of various GRAS 
gene products is shown at FIG. 29. As shown on the figure, 
the gene products have at least five recognizable motifs that 
are highly conserved. The absolutely conserved residues 
within the VHIID and SAW motifs are highlighted in bold, as 
are the hydrophobic residues of the leucine heptads, the 
P-F-Y-R-E residues of the PFYRE motif, and the two short 
sequences that define the end of the VHIID motif and the 
beginning of the PFYRE motif. 
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The GRAS family includes at present nineteen 
distinct members in Arabidopsis: fifteen SCLs, SCR, GAI , RGA, 
and RGAL (a GRAS sequence of unknown function with high 
similarity to GAI and RGA) . The fact that the SCR, GAI, and 
5 RGA gene products have diverse roles in fundamental processes 
in plant biology (SCR in pattern formation and GAI /RGA in 
signal transduction) suggests that other members of this 
family may also play important roles in the physiology and 
development of higher plants. Intriguingly, the majority of 

10 the SCL genes are expressed predominantly in the root. 

FIG. 30 and Table 3. Furthermore, one of these (SCL3) has an 
expression pattern in the root that is similar to that of 
SCR. FIG. 31. In addition to root, many of the SCL genes 
are expressed in siliques and shoots. See, Table 3. 

15 The SCL genes and gene products may be isolated and 

expressed with methods similar to those discussed for SCR 
genes at Sections 5.1.1. and 5.1.2., supra. Furthermore, 
antibodies to SCL proteins and polypeptides may be produced 
as was discussed in Section 5.1.3., supra. Finally, SCL 

2 0 genes and gene products may be used as markers for 

quantitative trait loci as was discussed at Section 5.1.4., 
supra . 
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5.2. SCR PROMOTERS 

According to the present invention, SCR promoters 
and functional portions thereof described herein refer to 
regions of the SCR gene which are capable of promoting 
5 tissue- specif ic expression in embryos, roots and shoots of an 
operably linked coding sequence in plants. The SCR promoter 
described herein refers to the regulatory elements of SCR 
genes, i.e., regulatory regions of genes which are capable of 
selectively hybridizing to the nucleic acids described in 

10 Section 5.1, or regulatory sequences contained, for example, 
in the region between the translational start site of the 
Arabidopsis SCR gene and the Hindi I I site approximately 2.5 
kb upstream of the site in plasmid pLIGl -3/SAC+Mob21SAC (see 
FIGS. 5A and 14) in hybridization assays, or which are 

15 homologous by sequence analysis (containing a span of 10 or 
more nucleotides in which at least 50 percent of the 
nucleotides are identical to the sequences presented herein) . 
Homologous nucleotide sequences refer to nucleotide sequences 
including, but not limited to, SCR promoters in diverse plant 

20 species (e.g., promoters of orthologs of Arabidopsis SCR) as 
well as genetically engineered derivatives of the promoters 
described herein. 

Methods which could be used for the synthesis, 
isolation, molecular cloning, characterization and 

25 manipulation of SCR promoter sequences are well known to 
those skilled in the art. See, e.g., the techniques 
described in Sambrook et al . , Molecular Cloning: A 
Laboratory Manual, 2nd. ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1989) . 

30 According to the present invention, SCR promoter 

sequences or portions thereof described herein may be 
obtained from appropriate plant or mammalian sources from 
cell lines or recombinant DNA constructs containing SCR 
promoter sequences, and/or by chemical synthetic methods. 

35 SCR promoter sequences can be obtained from genomic clones 
containing sequences 5 1 upstream of SCR coding sequences . 
Such 5' upstream clones may be obtained by screening genomic 
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libraries using SCR protein coding sequences, particularly 
those encoding SCR N-terminal sequences, from SCR gene clones 
obtained as described in Sections 5.1. and 5.2. Standard 
methods that may used in such screening include, for example, 
5 the method set forth in Benton & Davis, 1977, Science 196:180 
for bacteriophage libraries; and Grunstein & Hogness, 1975, 
Proc. Nat. Acad. Sci. U.S.A. 72:3961-3965 for plasmid 
libraries . 

The full extent and location of SCR promoters 

10 within such 5' upstream clones may be determined by the 

functional assay described below. In the event a 5' upstream 
clone does not contain the entire SCR promoter as determined 
by the functional assay, the insert DNA of the clone may be 
used to isolate genomic clones containing sequences further 

15 5 ' upstream of the SCR coding sequences . Such further 

upstream sequences can be spliced on to existing 5 1 upstream 
sequences and the reconstructed 5" upstream region tested for 
functionality as a SCR promoter (i.e., promoting tissue- 
specific expression in embryos and/or roots of an operably 

20 linked gene in plants) . This process may be repeated until 
the complete SCR promoter is obtained. 

The location of the SCR promoter within genomic 
sequences 5' upstream of the SCR gene isolated as described 
above may be determined using any method known in the art. 

25 For example, the 3' end of the promoter may be identified by 
locating the transcription initiation site, which may be 
determined by methods such as RNase protection (e.g., Liang 
et al., 1989, J. Biol. Chem. 264:14486-14498), primer 
extension (e.g., Weissenborn & Larson, 1992, J. Biol. Chem. 

30 267:6122-6131) and/or reverse transcriptase/PCR . The 

location of the 3 ' end of the promoter may be confirmed by 
sequencing and computer analysis, examining for the canonical 
AGGA or TATA boxes of promoters that are typically 50-60 base 
pairs (bp) and 25-35 bp, respectively, 5' upstream of the 

35 transcription initiation site. The 5' end promoter may be 

defined by deleting sequences from the 5' end of the promoter 
containing fragment, constructing a transcriptional or 
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translat ional fusion of the resected fragment and a reporter 
gene and examining the expression characteristics of the 
chimeric gene in transgenic plants. Reporter genes that may 
be used to such ends include, but are not limited to, GUS, 
5 CAT, luciferase, p-galactosidase and CI and R gene 
controlling anthocyanin production. 

According to the present invention, a SCR promoter 
is one that confers to an operably linked gene in a 
transgenic plant tissue-specific expression in roots, root 

10 nodules, stems and/or embryos. A SCR promoter comprises the 
region between about -5,000 bp and +1 bp upstream of the 
transcription initiation site of a SCR gene. In a particular 
embodiment, the Arabidopsis SCR promoter comprises the region 
between positions -2.5 kb and +1 in the 5' upstream region of 

15 the Arabidopsis SCR gene (see FIGS. 5A and 14) . 

5.2.1. CIS -REGULATORY ELEMENTS OF SCR PROMOTERS 
According to the present invention, the cis- 
regulatory elements within a SCR promoter may be identified 

20 using any method known in the art. For example, the location 
of cis-regulatory elements within an inducible promoter may 
be identified using methods such as DNase or chemical 
footprinting {e.g., Meier et al . , 1991, Plant Cell 3:309-315) 
or gel retardation (e.g., Weissenborn & Larson, 1992, J. 

25 Biol. Chem. 267-6122-6131; Beato, 1989, Cell 56:335-344; 
Johnson et al . , 1989, Ann. Rev. Biochem. 58:799-839). 
Additionally, resectioning experiments also may be employed 
to define the location of the cis-regulatory elements. For 
example, an inducible promoter-containing fragment may be 

30 resected from either the 5' or 3 ' end using restriction 
enzyme or exonuclease digests. 

To determine the location of cis-regulatory 
elements within the sequence containing the inducible 
.promoter, the 5* or 3 ' resected fragments, internal fragments 

35 to the inducible promoter containing sequence or inducible 
promoter fragments containing sequences identified by 
footprinting or gel retardation experiments may be fused to 
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the 5' end of a truncated plant promoter, and the activity of 
the chimeric promoter in transgenic plant examined. Useful 
truncated promoters to these ends comprise sequences starting 
at or about the transcription initiation site and extending 
5 to no more than 150 bp 5' upstream. These truncated 
promoters generally are inactive or are only minimally 
active. Examples of such truncated plant promoters may 
include, among others, a "minimal" CaMV 3 5S promoter whose 5' 
end terminates at position -46 bp with respect to the 

10 transcription initiation site (Skriver et al . , Proc . Natl. 
Acad. Sci. USA 88:7266-7270); the truncated "-90 35S" 
promoter in the X-GUS-90 vector (Benfey & Chua, 1989, Science 
244:174-181); a truncated "-101 nos" promoter derived from 
the nopaline synthase promoter (Aryan et al . , 1991, Mol . Gen. 

15 Genet. 225:65-71); and the truncated maize Adh-1 promoter in 
pADcat 2 (Ellis et al . , 1987, EMBO J. 6:11-16). 

According to the present invention, a cis- 
regulatory element of a SCR promoter is a sequence that 
confers to a truncated promoter tissue- specif ic expression in 

2 0 embryos, stems, root nodules and/or roots. 

5.2.2. SCR PROMOTER -DRIVEN EXPRESSION VECTORS 
The properties of the nucleic acid sequences are 
varied as are the genetic structures of various potential 

25 host plant cells. In the preferred embodiments of the 
present invention, described herein, a number of features 
which an artisan may recognize as not being absolutely 
essential, but clearly advantageous are used. These include 
methods of isolation, synthesis or construction of gene 

30 constructs, the manipulation of the gene constructs to be 
introduced into plant cells, certain features of the gene 
constructs, and certain features of the vectors associated 
with the gene constructs. 

Further, the gene constructs of the present 

35 invention may be encoded on DNA or RNA molecules. According 
to the present invention, it is preferred that the desired, 
stable genotypic change of the target plant be effected 
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through genomic integration of exogenously introduced nucleic 
acid construct (s) , particularly recombinant DNA constructs. 
Nonetheless, according to the present invention, such 
genotypic changes also can be effected by the introduction of 
5 episomes (DNA or RNA) that can replicate autonomously and 
that are somatically and germinally stable. Where the 
introduced nucleic acid constructs comprise RNA, plant 
transformation or gene expression from such constructs may 
proceed through a DNA intermediate produced by reverse 

10 transcription. 

The present invention provides for use of 
recombinant DNA constructs which contain tissue-specific and 
developmental -specif ic promoter fragments and functional , 
portions thereof. As used herein, a functional portion of a 

15 SCR promoter is capable of functioning as a tissue-specific 
promoter in the embryo, stem, root nodule and/or root of a 
plant. The functionality of such sequences can be readily 
established by any method known in the art. Such methods 
include, for example, constructing expression vectors with 

20 such sequences and determining whether they confer tissue- 
specific expression in the embryo, stem, root nodule and/or 
root to an operably linked gene. In a particular embodiment, 
the invention provides for the use of the Arabidopsis SCR 
promoter contained in the sequences depicted in FIGS. 5A and 

25 14 and the insert DNA of plasmid pGEX-2TK + . 

The SCR promoters of the invention may be used to 
direct the expression of any desired protein, or to direct 
the expression of a RNA product, including, but not limited 
to, an "antisense" RNA or ribozyme. Such recombinant 

3 0 constructs generally comprise a native SCR promoter or a 
recombinant SCR promoter derived therefrom, ligated to the 
nucleic acid sequence encoding a desired heterologous gene 
product . 

A recombinant SCR promoter is used herein to refer 
35 to a promoter that comprises a functional portion of a native 
SCR promoter or a promoter that contains native promoter 
sequences that is modified by a regulatory element from a SCR 
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promoter. Alternatively, a recombinant inducible promoter 
derived from the SCR promoter may be a chimeric promoter, 
comprising a full-length or truncated plant promoter modified 
by the attachment of one or more SCR cis- regulatory elements. 
5 The manner of chimeric promoter constructions may 

be any well known in the art. For examples of approaches 
that can be used in such constructions, see Section 5.1.2., 
above and Fluhr et al . , 1986, Science 232:1105-1112; Ellis et 
al., 1987, EMBO J. 6:11-16; Strittmatter & Chua, 1987, Proc. 

10 Natl. Acad. Sci . USA 84:8986-8990; Poulsen & Chua, 1988, Mol . 
Gen. Genet. 214:16-23; Comai et al . , 1991, Plant Mol. Biol. 
15:373-381; Aryan et al . , 1991, Mol. Gen. Genet. 225:65-71. 

According to the present invention, where a SCR 
promoter or a recombinant SCR promoter is used to express a 

15 desired protein, the DNA construct is designed so that the 
protein coding sequence is ligated in phase with the 
translational initiation codon downstream of the promoter. 
Where the promoter fragment is missing 5' leader sequences, a 
DNA fragment encoding both the protein and its 5' RNA leader 

20 sequence is ligated immediately downstream of the 

transcription initiation site. Alternatively, an unrelated 
5 ' RNA leader sequence may be used to bridge the promoter and 
the protein coding sequence. In such instances, the design 
should be such that the protein coding sequence is ligated in 

25 phase with the initiation codon present in the leader 
sequence, or ligated such that no initiation codon is 
interposed between the transcription initiation site and the 
first methionine codon of the protein. 

Further, it may be desirable to include additional 

30 DNA sequences in the protein expression constructs. Examples 
of additional DNA sequences include, but are not limited to, 
those encoding: a 3' untranslated region; a transcription 
termination and polyadenylation signal; an intron; a signal 
. peptide (which facilitates the secretion of the protein) ; or 

35 a transit peptide (which targets the protein to a particular 
cellular compartment such as the nucleus, chloroplast, 
mitochondria or vacuole) . 

- 56 - 



WO 00/53723 



PCT/USOO/05875 



5.3. PRODUCTION OF TRANSGENIC PLANTS AND PLANT 
CELLS 

According to the present invention, a desirable 

plant or plant cell may be obtained by transforming a plant 

2 cell with the nucleic acid constructs described herein. In 

some instances, it may be desirable to engineer a plant or 

plant cell with several different gene constructs. Such 

engineering may be accomplished by transforming a plant or 

plant cell with all of the desired gene constructs 

10 simultaneously. Alternatively, the engineering may be 
carried out sequentially. That is, transforming with one 
gene construct, obtaining the desired transformant after 
selection and screening, transforming the transformant with a 
second gene construct, and so on. 

^5 In an embodiment of the present invention, 

Agrobacterium is employed to introduce the gene constructs 
into plants. Such transformations preferably use binary 
Agrobacterium T-DNA vectors (Bevan, 1984, Nuc . Acid Res. 
12:8711-8721) and the co-cultivation procedure (Horsch et 

2Q al., 1985, Science 227:1229-1231). Generally, the 

Agrobacterium transformation system is used to engineer 
dicotyledonous plants (Bevan et al . , 1982, Ann. Rev. Genet. 
16:357-384; Rogers et al . , 1986, Methods Enzymol . 118:627- 
641) . The Agrobacterium transformation system also may be 

2 g used to transform, as well as transfer, DNA to 

monocotyledonous plants and plant cells (see Hernalsteen et 
al., 1984, EMBO J 3:3039-3041; Hooykass-Van Slogteren et al . , 
1984, Nature 311:763-764; Grimsley et al . , 1987, Nature 
325:1677-179; Boulton et al . , 1989, Plant Mol . Biol. 12:31- 

3Q 40.; Gould et al., 1991, Plant Physiol. 95:426-434). 

In other embodiments, various alternative methods 
for introducing recombinant nucleic acid constructs into 
plants and plant cells also may be utilized. These other 
methods are particularly useful where the target is a 

2 5 monocotyledonous plant or plant cell. Alternative gene 
transfer and transformation methods include, but are not 
limited to, protoplast transformation through calcium-, 

- 57 - 



WO 00/53723 



PCT/US00/05875 



polyethylene glycol (PEG) , electroporation-mediated uptake of 
naked DNA (see Paszkowski et al . , 1984, EMBO J 3:2717-2722, 
Potrykus et al . , 1985, Mol . Gen. Genet. 199:169-177; Fromm et 
al., 1985, Proc. Natl. Acad. Sci . USA 82:5824-5828; 
5 Shimamoto, 1989, Nature 338:274-276) and electroporation of 
plant tissues (D'Halluin et al . , 1992, Plant Cell 4:1495- 
1505) . Additional methods for plant cell transformation 
include microinjection, silicon carbide mediated DNA uptake 
(Kaeppler et al . , 1990, Plant Cell Reporter 9:415-418) and 

10 microprojectile bombardment (see Klein et al . , 1988, Proc. 
Natl. Acad. Sci. USA 85:4305-4309; Gordon-Kamm et al . , 1990, 
Plant Cell 2 :603-618) . 

According to the present invention, a wide variety 
of plants may be engineered for the desired physiological and 

15 agronomic characteristics described herein using the nucleic 
acid constructs of the instant invention and the various 
transformation methods mentioned above. In preferred 
embodiments, target plants for engineering include, but are 
not limited to, crop plants such as maize, wheat, rice, 

20 soybean, tomato, tobacco, carrots, peanut, potato, sugar 
beets, sunflower, yam, Arabidopsis, rape seed and petunia; 
and trees such as spruce . 

According to the present invention, desired plants 
and plant cells may be obtained by engineering the gene 

25 constructs described herein into a variety of plant cell 
types, including, but not limited to, protoplasts, tissue 
culture cells, tissue and organ explants, pollen, embryos as 
well as whole plants. In an embodiment of the present 
invention, the engineered plant material is selected or 

30 screened for transf ormants (i.e., those that have 

incorporated or integrated the introduced gene construct (s) ) 
following the approaches and methods described below. An 
isolated transf ormant may then be regenerated into a plant. 
Alternatively, the engineered plant material may be 

35 regenerated into a plant, or plantlet, before subjecting the 
derived plant, or plantlet, to selection or screening for the 
marker gene traits. Procedures for regenerating plants from 
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plant cells, tissues or organs, either before or after 
selecting or screening for marker gene(s), are well known to 
those skilled in the art. 

A transformed plant cell, callus, tissue or plant 
5 may be identified and isolated by selecting or screening the 
engineered plant material for traits encoded by the marker 
genes present on the transforming DNA. For instance, 
selection may be performed by growing the engineered plant 
material on media containing inhibitory amounts of the 

10 antibiotic or herbicide to which the transforming marker gene 
construct confers resistance. Further, transformed plants 
and plant cells also may be identified by screening for the 
activities of any visible marker genes {e.g., the 6- 
glucuronidase, luciferase, B or CI genes) that may be present 

15 on the recombinant nucleic acid constructs of the present 
invention. Such selection and screening methodologies are 
well known to those skilled in the art. 

Physical and biochemical methods also may be used 
to identify a plant or plant cell transformant containing the 

20 gene constructs of the present invention. These methods 

include, but are not limited to: 1) Southern analysis or PCR 
amplification for detecting and determining the structure of 
the recombinant DNA insert; 2) Northern blot, S-l RNase 
protection, primer-extension or reverse transcriptase-PCR 

25 amplification for detecting and examining RNA transcripts of 
the gene constructs; 3) enzymatic assays for detecting enzyme 
or ribozyme activity, where such gene products are encoded by 
the gene construct; 4) protein gel electrophoresis, western 
blot techniques, immunoprecipitation, or enzyme-linked 

3 0 immunoassays, where the gene construct products are proteins; 
5) biochemical measurements of compounds produced as a 
consequence of the expression of the introduced gene 
constructs. Additional techniques, such as in situ 
hybridization, enzyme staining, and immunostaining also may 

35 be used to detect the presence or expression of the 

recombinant construct in specific plant organs and tissues. 
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The methods for doing all of these assays are well known to 
those skilled in the art. 

5.3.1. TRANSGENIC PLANTS THAT ECTOPI CALLY 
5 EXPRESS SCR 

In accordance with the present invention, a plant 

that expresses a recombinant SCR gene may be engineered by 

transforming a plant cell with a gene construct comprising a 

plant promoter operably associated with a sequence encoding a 

1Q SCR protein or a fragment thereof. (Operably associated is 
used herein to mean that transcription controlled by the 
"associated" promoter would produce a functional messenger 
RNA, whose translation would produce the enzyme.) The plant 
promoter may be constitutive or inducible. Useful 
constitutive promoters include, but are not limited to, the 
CaMV 3 5S promoter, the T-DNA mannopine synthetase promoter 
and their various derivatives. Useful inducible promoters 
include, but are not limited to, the promoters of ribulose 
bisphosphate carboxylase (RUBISCO) genes, chlorophyll a/b 

2q binding protein (CAB) genes, heat shock. genes, the defense 
responsive gene (e.g., phenylalanine ammonia lyase genes), 
wound induced genes {e.g., hydroxyproline rich cell wall 
protein genes), chemically-inducible genes (e.g., nitrate 
reductase genes, gluconase genes, chitinase genes, PR-1 genes 

2^ etc.), dark- inducible genes (e.g., asparagine synthetase gene 
(Coruzzi and Tsai, U.S. Patent 5,256,558, October 26, 1993, 
Gene Encoding Plant Asparagine Synthetase) ) and 
developmentally regulated genes (e.g., Shoot Meristemless 
gene) , to name just a few. 

30 In yet another embodiment of the present invention, 

it may be advantageous to transform a plant with a gene 
construct operably linking a modified or artificial promoter 
to a sequence encoding a SCR protein or a fragment thereof. 
Typically, such promoters, constructed by recombining 

35 structural elements of different promoters, have unique 
expression patterns and/or levels not found in natural 
promoters. See, e.g., Salina et al . , 1992, Plant Cell 
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4:1485-1493, for examples of artificial promoters constructed 
from combining cis-regulatory elements with a promoter core. 

In a preferred embodiment of the present invention, 
the associated promoter is a strong and root, root nodule, 
5 stem and/or embryo -specific plant promoter such that the SCR 
protein is overexpressed in the transgenic plant. Examples 
of root- and root nodules-specific promoters include, but are 
not limited to, the promoters of SCR genes, SHR genes, 
legehemoglobin genes, nodulin genes and root-specific 

10 glutamine synthetase genes (See e.g., Tingey et al . , 1987, 
EMBO J. 6:1-9,- Edwards et al., 1990, Proc . Nat. Acad. Sci . 
USA 87 :3459-3463) . 

In yet another preferred embodiment of the present 
invention, the overexpression of SCR protein in roots may be 

15 engineered by increasing the copy number of the SCR gene. 
One approach to producing such transgenic plants is to 
transform with nucleic acid constructs that contain multiple 
copies of the complete SCR gene (i.e., with its own native 
SCR promoter) . Another approach is to repeatedly transform 

20 successive generations of a plant line with one or more 

copies of the complete SCR gene. Yet another approach is to 
place a complete SCR gene in a nucleic acid construct 
containing an amplification-selectable marker (ASM) gene such 
as the glutamine synthetase or dihydrof olate reductase gene. 

25 Cells transformed with such constructs are subjected to 
culturing regimes that select cell lines with increased 
copies of complete SCR genes. See, e.g., Donn et al . , 1984, 
J. Mol . Appl . Genet. 2:549-562, for a selection protocol used 
to isolate a plant cell line containing amplified copies of 

30 the GS gene. Because the desired gene is closely linked to 
the ASM, cell lines that amplify the ASM gene are likely also 
to have amplified the SCR gene. Cell lines with amplified 
copies of the SCR gene can then be regenerated into 
•transgenic plants. 
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5.3.2. TRANSGENIC PLANTS THAT SUPPRESS 
ENDOGENOUS SCR EXPRESSION 

In accordance with the present invention, a desired 
plant may be engineered by suppressing SCR activity. In one 
5 embodiment, the suppression may be engineered by transforming 
a plant with a gene construct encoding an antisense RNA or 
ribozyrne complementary to a segment, or the whole, of the SCR 
RNA transcript, including the mature target mRNA. In another 
embodiment, SCR gene suppression may be engineered by 

10 transforming a plant cell with a gene construct encoding a 
ribozyrne that cleaves the SCR mRNA transcript. 
Alternatively, the plant can be engineered, e.g., via 
targeted homologous recombination, to inactive or "knock-out" 
expression of the plant's endogenous SCR. 

15 For all of the aforementioned suppression 

constructs, it is preferred that such gene constructs express 
specifically in the root, root nodule, stem and/or embryo 
tissues. Alternatively, it may be preferred to have the 
suppression constructs expressed constitut ively . Thus, 

20 constitutive promoters, such as the nopaline and the CaMV 35S 
promoter, also may be used to express the suppression 
constructs. A most preferred promoter for these suppression 
constructs is a SCR or SHR promoter. 

In accordance with the present invention, desired 

25 plants with suppressed target gene expression may be 
engineered also by transforming a plant cell with a co- 
suppression construct. A co- suppression construct comprises 
a functional promoter operatively associated with a complete 
or partial SCR gene sequence. It is preferred that the 

30 operatively associated promoter be a strong, constitutive 

promoter, such as the CaMV 35S promoter. Alternatively, the 
co- suppression construct promoter can be one that expresses 
with the same tissue and developmental specificity as the SCR 
gene . 

35 According to the present invention, it is preferred 

that the co-suppression construct encodes an incomplete SCR 
mRNA, although a construct encoding a fully functional SCR 
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mRNA or enzyme also may be useful in effecting co- 
suppression. 

In accordance with the present invention, desired 
plants with suppressed target gene expression also may be 
5 engineered by transforming a plant cell with a construct that 
can effect site-directed mutagenesis of the SCR gene. {See, 
e.g., Offringa et al . , 1990, EMBO J. 9:3077-84; and Kanevskii 
et al., 1990, Dokl . Akad. Nauk. SSSR 312:1505-1507 for 
discussions of nucleic constructs for effecting site-directed 
10 mutagenesis of target genes in plants.) It is preferred that 
such constructs effect suppression of the SCR gene by 
replacing the endogenous SCR gene sequence through homologous 
recombination with either none, or inactive SCR protein 
coding sequences . 

15 

5.3.3. TRANSGENIC PLANTS THAT EXPRESS A 

TRANSGENE CONTROLLED BY THE SCR PROMOTER 

In accordance with the present invention, a desired 
plant may be engineered to express a gene of interest under 

2 0 the control of the SCR promoter. SCR promoters and 

functional portions thereof refer to regions of the nucleic 
acid sequence which are capable of promoting tissue-specific 
transcription of an operably linked gene of interest in the 
embryo, stem, root nodule and/or root of a plant. The SCR 

25 promoter described herein refers to the regulatory elements 
of SCR genes as described in Section 5.2. 

Genes that may be beneficially expressed in the 
roots and/or root nodules of plants include genes involved in 
nitrogen fixation or cytokines or auxins, or genes which 

30 regulate growth, or growth of roots. In addition, genes 
encoding proteins that confer on plants herbicide, salt or 
pest resistance may be engineered for root specific 
expression. The nutritional value of root crops may be 
•enhanced also through SCR promoter driven expression of 

35 nutritional proteins. Alternatively, therapeutically useful 
proteins may be expressed specifically in root crops. 
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Genes that may be beneficially expressed in the 
stems of plants include those involved in starch lignin or 
cellulose biosynthesis. 

In accordance with the present invention, desired 
5 plants which express a heterologous gene of interest under 
the control of the SCR promoter may be engineered by 
transforming a plant cell with SCR promoter driven constructs 
using those techniques described in Section 5.2.2. and 5.3., 
supra . 

10 

5.3.4. SCREENING OF TRANSFORMED PLANTS FOR THOSE 
HAVING. DESIRED ALTERED TRAITS 

It will be recognized by those skilled in the art 

that in order to obtain transgenic plants having the desired 

15 engineered traits, screening of transformed plants (i.e., 

those having an gene construct of the invention) having those 
traits may be required. For example, where the plants have 
been engineered for ectopic overexpression of a SCR gene, 
transformed plants are examined for those expressing the SCR 

2Q gene at the desired level and in the desired tissues and 

developmental stages. Where the plants have been engineered 
for suppression of the SCR gene product, transformed plants 
are examined for those expressing the SCR gene product (e.g., 
RNA or protein) at reduced levels in various tissues. The 

2j plants exhibiting the desired physiological changes, e.g., 
ectopic SCR overexpression or SCR suppression, may then be 
subsequently screened for those plants that have the desired 
structural changes at the plant level (e.g., transgenic 
plants with overexpression or suppression of SCR gene having 

3Q the desired altered root structure) . The same principle 

applies to obtaining transgenic plants having tissue-specific 
expression of a heterologous gene in embryos and/or roots by 
the use of a SCR promoter driven expression construct . 

Alternatively, the transformed plants may be 

35 directly screened for those exhibiting the desired structural 
and functional changes. In one embodiment, such screening 
may be for the size, length or pattern of the root of the 
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transformed plants. In another embodiment, the screening of 
the transformed plants may be for altered gravitropism or 
decreased susceptibility to lodging. In other embodiments, 
the screening of the transformed plants may be for improved 
5 agronomic characteristics (e.g., faster growth, greater 
vegetative or reproductive yields or improved protein 
contents, etc.), as compared to unengineered progenitor 
plants, when cultivated under various growth conditions 
(e.g., soils or media containing different amounts of 

10 nutrients and water content) . 

According to the present invention, plants 
engineered with SCR overexpression may exhibit improved 
vigorous growth characteristics when cultivated under 
conditions where large and thicker roots are advantageous. 

15 Plants engineered for SCR suppression may exhibit improved 
vigorous growth characteristics when cultivated under 
conditions where thinner roots are advantageous. 

Engineered plants and plant lines possessing such 
improved agronomic characteristics may be identified by 

20 examining any of following parameters: 1) the rate of growth, 
measured in terms of rate of increase in fresh or dry weight; 
2) vegetative yield of the mature plant, in terms of fresh or 
dry weight; 3) the seed or fruit yield; 4) the seed or fruit 
weight; 5) the total nitrogen content of the plant; 6) the 

25 total nitrogen content of the fruit or seed; 7) the free 
amino acid content of the plant; 8) the free amino acid 
content of the fruit or seed; 9) the total protein content of 
the plant; and 10) the total protein content of the fruit or 
seed. The procedures and methods for examining these 

30 parameters are well known to those skilled in the art. 

According to the present invention, a desired plant 
is one that exhibits improvement over the control plant 
(i.e., progenitor plant) in one or more of the aforementioned 
parameters. In an embodiment, a desired plant is one that 

one parameter. In a preferred embodiment, a desired plant is 
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in at least one parameter. Most preferred is a plant that 
shows at least 50% increase in at least one parameter. 

6. EXAMPLE 1: ARABIDOPSIS SCR GENE 
5 This example describes the cloning and structure of 

the Arabidopsis SCR gene and its expression. The deduced 
amino acid sequence of the Arabidopsis SCR gene product 
contains a number of potential functional domains similar to 
those found in transcription factors. Closely related 

10 sequences have been found in both dicots and monocots 

indicating that Arabidopsis SCR is a member of a new protein 
family. The expression pattern of the SCR gene was 
characterized by means of in situ hybridization and by an .. 
enhancer trap insertion upstream of the SCR gene (described 

15 in more detail in Section 7) . The expression pattern is 

consistent with a key role for Arabidopsis SCR in regulating 
the asymmetric division of the cortex/endodermis initial 
which is essential for generating the radial organization of 
the root . 

20 

6.1. MATERIALS AND METHODS 
6.1.1. PLANT CULTURE 

Arabidopsis ecotypes Wassilewski j a (Ws) , Columbia 
(Col), and Landsberg erecta (Ler) were obtained from Lehle. 
25 Arabidopsis seeds were surface sterilized and grown as 
described previously (Benfey et al . , 1993, Development 
119:57-70). Generation of the enhancer trap lines is 
described in Section 7. 

30 6.1.2. GENETIC ANALYSIS 

For the scr-1 allele, co-segregation of the mutant 
phenotype and kanamycin resistance conferred by the inserted 
T-DNA was determined as described previously (Aeschbacher et 
•al . , 1995, Genes & Development 9:330-340). Because kanamycin 

35 affects root growth, 1557 seeds from heterozygous lines were 
germinated on non- selective media, scored for the appearance 
of the mutant phenotype, and subsequently transferred to 
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selective media. All (284) phenotypically mutant seedlings 
showed resistance to the antibiotic, whereas 834 of 1273 
phenotypically wild-type seedlings showed resistance to 
kanamycin, respectively. Phenotypically wild type plants 
5 (83) were also transferred to soil and allowed to set seeds. 
The progeny of these plants were plated on selective and non- 
selective media, and scored for the co-segregation of the 
mutant phenotype and antibiotic resistance. A majority (4 8) 
of the plants segregated for the mutant phenotype and for 

10 kanamycin resistance, whereas 35 were wild- type and sensitive 
to kanamycin. Due to a mis-identified cross, scr-2 was 
originally thought to be non-allelic and was named pinocchio 
(Scheres et al . , 1995, Development 121:53-62). Subsequent- 
mapping results placed it in an identical chromosomal 

15 location as scr-1. The original scr-2 line contained at 

least two T-DNA inserts. Co- segregation analysis revealed a 
lack of linkage between the antibiotic resistance marker 
carried by the T-DNA and the mutant phenotype. Antibiotic 
sensitive lines were identified that segregated for mutants. 

20 These lines were crossed to scr-1. All Fl antibiotic 
resistant progeny exhibited a mutant phenotype. All F2 
progeny (from independent lines) were mutant, and there was a 
3:1 segregation for antibiotic resistance indicating that the 
two mutations were allelic. Antibiotic sensitive lines of 

25 scr-2 were found to contain a rearranged T-DNA insert as 
determined by Southern blots and PCR using T-DNA specific 
probes and primers, respectively. The presence of this T-DNA 
in the SCR gene was confirmed by Southern blots using SCR 
probes. A combination of T-DNA and SCR specific primers was 

30 used to amplify T-DNA/ SCR junctions. The PCR fragments were 
cloned using the TA cloning kit (Invitrogen) and sequenced. 
The insertion points were determined for both 5' and 3* T- 
DNA/SCJ? junctions. 
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6.1.3. MAPPING 

Mutant plants of scr-2 (WS background) were crossed 
to Col WT. DNA from mutant F2 individual plants were 
analyzed for co- segregation with microsatellite (Bell & 
5 Ecker, 1994, Genomics 18:137-144) and CAPS markers (Konieczny 
& Ausubel, 1993, Plant J. 4:403-410). The closest linkage 
was found to two CAPS markers located at the bottom of 
chromosome III. Only one out of 238 mutant chromosomes was 
recombinant for the BGL1 marker (Konieczny & Ausubel, 1993, 

10 Plant J. 4:403-410) and one out of 210 chromosomes was 
recombinant for the cdc2b marker. 

A RFLP for the SCR gene was identified between Col 
and Ler ecotypes with Xho I endonuclease . Genomic DNAs from 
independent Rl lines (Jarvis et al . , 1994, Plant Mol . Biol. 

15 24:685-687) were digested with Xho I and blots were 

hybridized to SCR. Using the segregation data obtained for 
25 Rl lines, the SCR gene was mapped relative to molecular 
markers by CLUSTER. The SCR gene was assigned to the bottom 
of chromosome III closest to BGL1 . 

20 

6.1.4. PHENOTYPIC ANALYSIS 

Morphological characterization of the mutant roots 
was performed as follows: 7 to 14 days post -germination, 
phenotypically mutant seedlings were fixed in 4.0% 

25 formaldehyde in PIPES buffer pH 7.2. After fixation, the 
samples were dehydrated in ethanol followed by infiltration 
with Historesin (Jung-Leica, Heidelberg, Germany) . Plastic 
sections were mounted on superfrost slides (Fisher) . The 
sections were either stained with 0.05% toluidine blue and 

30 photographed using Kodak 160T film, or used for Casparian 
strip detection or antibody staining. 

Casparian strip detection was performed as 
described previously (Scheres et al . , 1995, Development 
121:53-62), with the following modifications. Plastic 

35 sections were used and the counterstaining was done in 0.1% 
aniline blue for 5 to 15 min . The sections were visualized 
with a Leitz fluorescent microscope with a FITC filter. 
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Pictures were taken using a Leitz camera attached to the 
microscope and Kodak HC400 film. Slides were digitized with 
a Nikon slide scanner and manipulated in Adobe Photoshop. 

For antibody staining, sections were blocked for 2 
5 hours at room temperature in 1% BSA in PBS containing 0.1% 
Tween 2 0 (PBT) . Samples were incubated with primary 
antibodies at 4° C in 1% BSA in PBT overnight, and then 
washed 3 times 5 minutes each with PBT. Samples were 
incubated for two hours with biotinylated secondary 

10 antibodies (Vector Laboratories) in PBT, and washed as above. 
Samples were incubated with Texas Red conjugated avidin D for 
2 hours at room temperature, washed as before, and mounted in 
Citifluor. Immunofluorescence was observed with a 
fluorescent microscope equipped with a Rhodamine filter. 

15 Staining with the CCRC antibodies was performed as described 
previously (Freshour et al . , 1996, Plant Physiol. 110:1413- 
1429) . 

6.1.5. MOLECULAR TECHNIQUES 

20 Genomic DNA preparation was performed using the 

Elu-Quik kit (Schleicher & Schuell) protocol. Radioactive 
and non-radioactive DNA probes were labeled with either 
random primed labeling or PCR-mediated synthesis according to 
the Genius kit manual (Boehringer Mannheim) . E. coli and 

25 AgroJbacterium tumefaciens cells were transformed using a BIO- 
RAD gene pulser. Plasmid DNA was purified using the alkaline 
lysis method (Maniatis et al . , Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor, New York: Cold Spring 
Harbor Laboratory, 1982) . 

3 0 A probe made from a rescued fragment of 1.2 kb was 

used to screen a wild-type genomic library made from WS 
plants. One genomic clone containing an insert of 
approximately 23 kb was isolated. A 3.0 kb Sac I fragment 
•from the genomic clone, which hybridized to the 1 . 2 kb probe, 

35 was subcloned and sequenced (FIG. 5A) . Comparison of the 

nucleotide sequence between the genomic clone and the rescued 
plasmid revealed the site of the T-DNA insertion. 
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Approximately 600,000 plaques from a cDNA library, obtained 
from inflorescences and siliques (Col ecotype) , and therefore 
enriched in embryos, were screened with the 1.2 kb probe. 
Four cDNA clones were isolated. The dideoxy sequencing 
5 method was performed using the Sequenase kit (United States 
Biochemical Corp.). Sequence-specific internal primers were 
synthesized and used to sequence the Sac I genomic as well 
the cDNA clones. Total RNA from plant tissues was obtained 
using phenol/chloroform extractions as described in Berry et 
10 al., 1985, Mol. Cell. Biol. 5:2238-2246 with minor 

modifications. Northern hybridization and detection were 
performed according to the Genius kit manual (Boehringer 
Mannheim) . 

To identify the site of insertion of the enhancer- 
15 trap T-DNA, genomic DNA from ET199 homozygous plants was 
amplified using primers specific for the T-DNA left border 
and the SCR gene. An approximately 2 . 0 kb fragment was 
amplified. This fragment was sequenced and the site of 
insertion was found to be approximately 1 kb from the ATG 
20 start codon. 

6.1.6. IN SITU HYBRIDIZATION 

Antisense and sense SCR riboprobes were labeled 
with digoxigenin-ll-UTP (Boehringer Mannheim) using T7 
25 polymerase following the manufacturer's protocol. Probes 
contained a 1.1 kb 3' portion of the cDNA. Probe 
purification, hydrolysis and quantification were performed as 
described in the Boehringer Mannheim Genius System user's 
guide . 

30 Tissue samples were fixed in 4 % formaldehyde 

overnight at 4°C and rinsed two times in PBS (Jackson et al . , 
1991, PI. Cell 3:115-125). They were subsequently pre- 
embedded in 1 % agarose in PBS. The fixed tissue was 
•dehydrated in ethanol, cleared in Hemo-De (Fisher Scientific, 

35 Pittsburgh, PA) and embedded in ParaplastPlus (Fisher 

Scientific) . Tissue sections (10/^m thick) were mounted on 
Superf rostPlus slides (Fisher Scientific) . Section 



- 70 - 



WO 00/53723 



PCTAJSOO/05875 



pretreatment and hybridization were performed according to 
Lincoln et al . , 1994, Plant Cell 6:1859-1876 except that 
proteinase K was used at 3 0 mg/ml and a two hour 
prehybridization step was included. A probe concentration of 
5 50 ng/ml/kb was used in the hybridization. 

Slides were washed and the immunological detection 
was performed according to Coen et al . , 1990, Cell 63:1311- 
1322 with the following modifications. Slides were first 
washed 5 hours in 5xSSC, 50% f ormamide . After RNase 

10 treatment, slides were rinsed three times (20 min each) in 
buffer (0.5 M NaCl, 10 mM Tris-HCl pH 8.0, 5.0 mM EDTA) . In 
the immunological detection, antibody was diluted 1:1000, 
levamisole (240 ng/ml) was included in the detection buffer, 
and after stopping the reaction in 10 mM Tris, 1 mM EDTA, 

15 sections were mounted directly to Aqua -Poly /Mount 
(Polysciences, Warrington, PA) . 

6.2. RESULTS 

6.2.1. CHARACTERIZATION OF THE SCR PHENOTYPE 

20 The scarecrow mutant scr-1 was isolated in a screen 

of T-DNA transformed Arabidopsis lines (Feldmann, K.A. , 1991, 
Plant J. 1:71-82), as a seedling with greatly reduced root 
length compared to wild-type (Scheres et al . , 1995, 
Development 121:53-62) . A second mutant scr-2 with a similar 

25 phenotype was subsequently identified among T-DNA transformed 
lines. Analysis of co-segregation between the mutant 
phenotype and antibiotic resistance carried by the T-DNA 
indicated tight linkage for scr-1 and no linkage for scr-2 
(see Experimental Procedures) . An antibiotic sensitive line 

30 of scr-2 was isolated and crossed with scr-1. The F2 progeny 
of this cross were all mutant and segregated 3:1 for 
antibiotic resistance confirming allelism (see Materials & 
Methods) . The principal phenotypic difference between the 
two alleles was that scr-1 root growth was more retarded than 

35 that of scr-2, suggesting that it is the stronger allele 
(FIG. 2A) . For both alleles, the aerial organs appeared 
similar to wild- type and the flowers were fertile (FIGS. 2A 
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and 2B) . The progeny of backcrosses of scr-1 or scr-2 to 
wild- type plants segregated 3:1 for the root phenotype for 
both alleles, indicating that each mutation is monogenic and 
recessive . 

5 Analysis of transverse sections through the primary 

root of seedlings revealed only a single cell layer between 
the epidermis and the pericycle (FIG. 2C) instead of the 
normal radial organization consisting of cortex and 
endodermis (FIG. 2D) . This radial organization defect was 

10 not limited to the primary root, but also was present in 

secondary roots (FIG. 2E) and in roots regenerated from calli 
(FIG. 2F) . Occasionally, defects were observed in the number 
of cells in the remaining cell layer (more than the invariant 
eight (8) found in wild-type) . Abnormal placement or numbers 

15 of epidermal cells also were observed (see FIG. 2E) . These 
abnormalities were more frequently observed in scr-1 than in 
scr-2. Nevertheless, organization of the mutant root closely 
resembles that of wild-type except for the consistent 
reduction in the number of cell layers. Because the 

20 endodermis and cortex are normally generated by an asymmetric 
division of the cortex/ endodermal initial, this indicates 
that the primary defect in scr is disruption of this 
asymmetric division. 

It has been shown that the radial organization 

25 defect in scr-1 first appears in the developing embryo at the 
early torpedo stage and manifests itself as a failure of the 
embryonic ground tissue to undergo the asymmetric division 
into cortex and endodermis (Scheres et al . , 1995, Development 
121:53-62). This defect extends the length of the embryonic 

30 axis which encompasses the embryonic root and hypocotyl . 

Other embryonic tissues appear similar to wild-type (Scheres 
et al., 1995, Development 121:53-62). In seedling hypocotyls 
of the scarecrow phenotype, two cell layers instead of the 
•normal three layers (two cortex and one endodermis) between 

35 epidermis and stele were found. This would be the expected 
result of the lack of the division of the embryonic ground 
tissue. Similar results were obtained for scr-2. Hence, 
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this mutant identifies a gene involved in the asymmetric 
division that produces cortex and endodermis from ground 
tissue in the embryonic root and hypocotyl and from the 
cortex/endodermal initials in primary and secondary roots. 

5 

6.2.2. CHARACTERIZATION OF CELL IDENTITY IN SCR 
ROOTS 

To understand the role of the Arabidopsis SCR gene 

in regulating this asymmetric division, it was necessary to 

1Q determine the identity of the mutant cell layer. Tissue - 
specific markers were used to distinguish between several 
possibilities. The cell layer could have differentiated 
attributes of either cortex or endodermis. Alternatively, it 
could have an undifferentiated, initial-cell identity or it 

15 could have a chimeric identity with differentiated attributes 
of both endodermis and cortex in the same cell. 

Transverse sections of scr-2 and scr-2 roots were 
assayed for the presence of tissue- specif ic markers. The 
casparian strip, a deposition of suberin between radial cell 

20 walls, is specific to endodermal cells and is believed to act 
as a barrier to the entry of solutes into the vasculature 
(Esau, K. Anatomy of Seed Plants, Mew York: John Wiley & 
Sons, 1977, Ed. 2, pp. 1-550) . Histochemical staining 
revealed the presence of a casparian strip in the mutant cell 

25 layer (FIG. 3A, compare to wild-type, FIG. 3B) . It is noted 
that in the vascular cylinder, this histochemical stain also 
reveals the presence of lignin, indicating the presence of 
differentiated xylem cells in mutant (FIG. 3A) and wild-type 
(FIG. 3B) . Another marker of the differentiated endodermis 

2Q is the arabinogalactan epitope recognized by the monoclonal 
antibody, JIM13 (Knox et al . , 1990, Planta 181:512-521). The 
mutant cell layer showed staining with this antibody 
(FIG. 3C, compare with wild-type, FIG. 3B) . As a positive 
control, the JIM7 antibody that recognizes pectin epitopes in 

35 all cell walls was used (FIGS. 3E and 3F) . These results 
indicate that the cell layer between the epidermis and the 
pericycle has differentiated attributes of the endodermis. 
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As a marker for the cortex, the CCRC-M2 monoclonal 
antibody was used. This antibody recognizes a cell wall 
oligosaccharide epitope, found only on differentiated cortex 
and epidermis cells. In sections from the differentiation 
5 zone of scr-1 and scr-2, both cortex and epidermal cells 

showed staining (PIG. 4A and 4B) that was similar to that of 
wild-type (FIG. 4C) . In scr-2, staining of both cell types 
was apparent, but staining of cortex was somewhat weaker than 
wild-type. The positive control used the CCRC-M1 monoclonal 

10 antibody which recognizes an oligosaccharide epitope found on 
all cells (FIGS. 4D-F) . 

With the CCRC-M2 antibody, an interesting 
difference was observed between the staining pattern of the 
mutants as compared to wild- type. The appearance of this 

15 epitope correlates with differentiation in these two cell 

types. Normally, in sections close to the root tip, there is 
no staining. In sections higher up in the root, 
atrichoblasts (epidermal cells that do not make root hairs) 
stain. In sections from more mature root tissue, all 

20 epidermal cells as well as cortex cells stain for this 

epitope. In both scr-1 and scr-2, sections could be found in 
which all epidermal cells stained while there was little 
detectable staining of cortex cells. Although not precisely 
identical to the wild-type staining pattern, the fact that 

25 the mutant cell layer clearly stains for this cortex marker 
indicates that there are cortex differentiated attributes 
expressed in these cells. 

Taken together, these results indicate that the 
mutant cell layer has differentiated attributes of both the 

30 endodermis and cortex. The possibility that there has been a 
simple deletion of a cell type, or that the resulting cell 
type remains in an undifferentiated initial-like stage can be 
ruled out . This result is consistent with a role for the SCR 
gene in regulating this asymmetric division rather than a 

35 role in directing cell specification. 
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6.2.3. MOLECULAR CLONING OF THE SCR GENE 
To further elucidate the function of the 
Arabidopsis SCR gene, the inserted T-DNA sequences were used 
to clone the gene. Plant DNA flanking the insertion site was 
5 obtained from scr-1 by plasmid rescue and used to isolate the 
corresponding wild- type genomic DNA. Several cDNA clones 
were isolated from a library made from silique tissue. 
Comparison of the sequence of the longest cDNA and the 
corresponding genomic region revealed an open reading frame 

10 (ORF) interrupted by a single small intron. (FIG. 5A) . A 
potential TATA box and polyadenylation signal that matched 
the consensus sequences for plant genes were also identified 
(Joshi, CP., 1987, Nucl . Acids Res. 15:6643-6653); Heidecker 
& Messing, 1986, Ann. Rev. Plant Physiol. 37:439-466); Mogen 

15 et al., 1990, Plant Cell 2:1261-1272). 

Comparison of the nucleotide sequence between the 
genomic clone and the rescued plasmid placed the site of the 
T-DNA insertion in scr-1 at codon 470 (FIGS. 5A and 5B) . For 
scr-2, although no linkage was found between the mutant 

20 phenotype and antibiotic resistance, DNA blot and PCR 

analysis of antibiotic sensitive lines revealed the presence 
of T-DNA sequences that co-segregated with the mutant 
phenotype. The insertion position in scr-2 was determined by 
cloning and sequencing the PCR products amplified from its 

25 genomic DNA using a combination of T-DNA and SCR specific 
primers at both sides of the insertion (FIG. 5B) . In scr-2, 
the T-DNA insertion point is at codon 605 (FIG. 5A and 5B) . 

To verify linkage between the cloned gene and the 
mutant phenotype, we identified the chromosomal location of 

3 0 both the scr locus and the SCR gene. To map the scr locus, 
molecular markers were used on F2 progeny of crosses between 
scr-2 (ecotype Wassilewskija, Ws) and Colombia (Col) WT. 
These placed the scr locus at the bottom of chromosome III, 
'approximately 0.5 cM away from each of the two closest 

35 markers available, cdc2b and BGL1 (Konieczny and Ausubel, 
1993, Plant J. 4:403-410). To map the SCR gene, we 
identified a polymorphism between Col and Landsberg (Ler) 
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ecotypes using the SCR probe b (FIG. 5B) . Southern analysis 
of 25 recombinant inbred lines (Jarvis et al . , 1994, Plant 
Mol . Biol. 24:685-687) mapped the cloned gene to the same 
location as the SCR locus on chromosome III. 
5 The determination of the molecular defects in two 

independent alleles and the co-localization of the cloned 
gene and the mutant locus confirms that we have identified 
the SCR gene . 

10 6.2.4. THE SCR GENE HAS MOTIFS THAT INDICATE IT 
IS A TRANSCRIPTION FACTOR 

The Arabidopsis SCR gene product is a 653 amino 
acid polypeptide that contains several domains (FIG. 5B) . - 
The amino-terminus has homopolymeric stretches of glutamine, 

15 serine, threonine and proline residues, which account for 44% 
of the first 267 residues. Domains rich in these residues 
have been shown to activate transcription and may serve such 
a role in SCR (Johnson et al . , 1993, J. Nutr. Biochem 4:386- 
398) . A charged region between residues 265 and 283 has 

20 similarity to the basic domain of the bZIP family of 

transcriptional regulatory proteins (FIG. 5C) (Hurst, H.C , 
1994, Protein Profile 1:123-168). The basic domains from 
several bZIP proteins have been shown to act as nuclear 
localization signals (Varagona et al . , 1992, Plant Cell 

25 4:1213-1227), and this region in SCR may act similarly. This 
charged region is followed by a leucine heptad repeat 
(residues 291-322) . A second leucine heptad repeat is found 
toward the carboxy- terminus (residues 43 6 to 4 73) . As 
leucine heptad repeats have been demonstrated to mediate 

30 protein-protein interactions in other proteins (Hurst, H.C, 
1994, Protein Profile 1:123-168), the existence of these 
motifs suggests that SCR may function as a dimer or a 
multimer. The second leucine heptad repeat is followed by a 
•small region rich in acidic residues, also present in a 

35 number of defined transcriptional activation domains (Johnson 
et al., 1993, J. Nutr Biochem 4:386-398). While each of 
these domains has been found within proteins that do not act 
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as transcriptional regulators, the fact that all of them are 
found within the deduced SCR protein sequence indicates that 
SCR is a transcriptional regulatory protein. 

5 6.2.5. SCR IS A MEMBER OF A NOVEL PROTEIN FAMILY 

The Arabidopsis SCR protein sequence was compared 
with the sequences in the available databases. Eleven 
expressed sequence tags (ESTs) , nine from Arabidopsis, one 
from rice and one from maize, showed significant similarity 

10 to residues 394 to 435 of the SCR sequence, a region 
immediately amino-terminal to the second leucine heptad 
repeat (FIGS. 15K-L) . This region is designated the VHIID 
domain. Subsequent analysis of these EST sequences has 
revealed that the sequence similarity extends beyond this 

15 region; in fact, the similarity extends throughout the entire 
known gene products. The combination and order of the motifs 
found in these sequences do not show significant similarity 
to the general structures of other established regulatory 
protein families (i.e., bZIP, zinc finger, MADS-domain and 

20 homeodomain) , indicating that the SCR proteins comprise a 
novel family. 

6.2.6. SCR IS EXPRESSED IN THE CORTEX/ENDODERMAL 
INITIALS AND IN THE ENDODERMIS 

25 RNA blot analysis revealed expression of SCR in 

Arabidopsis siliques, leaves and roots of wild- type plants 
(FIG. 6A) . No hybridization was detected to RNA from scr-1 
plants (FIG. 6B, lane 2) . This indicates that scr-1 has a 
reduced level of RNA expression and may represent the null 

30 phenotype. Hybridization to RNA species larger than the 
normal size were detected in scr-2. This indicates that 
abnormal SCR transcripts are made in this allele, suggesting 
that functional but possibly altered proteins may be 
■produced . 

35 To determine if expression was localized to any 

particular cell type, RNA in situ hybridization was performed 
on sections of root tissue. In mature roots, expression was 



- 77 - 



WO 00/53723 



localized primarily to the endodermis (FIGS. 7A and 7B) . 
Expression appeared to start very close to, or within, the 
cortex/endodermal initials and continue up the endodermal 
cell file as far as the section extended. Expression was 
5 detected also in late-torpedo stage embryos in the endodermis 
throughout the embryonic axis (FIG. 7C) . Sense strand 
controls showed only background hybridization (FIG. 7D) . 

To determine whether the localization of SCR RNA 
was regulated at the transcriptional or post-transcriptional 

10 level, enhancer trap (ET) lines were prepared and examined in 
which the ^-glucuronidase (uid-A or GUS) coding sequence with 
a minimal promoter was expressed in the root endodermis. 
(See Section 7, infra) . Restriction fragment length 
polymorphisms were observed when DNA from one of these lines, 

15 ET199 and wild-type were probed with SCR. PCR and sequence 
analysis confirmed that the enhancer-trap construct had 
inserted approximately 1 kb upstream o£ the SCR start site 
and in the same orientation as that of SCR transcription. 

In mature roots, expression in ET199 whole mounts 

20 showed a similar pattern to that of the in situ 

hybridizations, with the strongest staining present in 
endodermal cells (FIG. 7E) . Transverse sections indicated 
that expression was primarily in endodermal cells in the 
elongation zone (FIG. 7F) . Longitudinal sections through the 

25 meristematic zone revealed that expression could be detected 
in the cortex/endodermal initial (FIG. 7G) . Of particular 
interest was the restriction of expression to the endodermal 
daughter cell after the periclinal division (FIG. 7G) . This 
indicated that the expression pattern observed in the in situ 

30 analysis was not due to post-transcriptional partitioning of 
SCR RNA. Rather, it suggests that after the periclinal 
division of the cortex/endodermis initial, only one of the 
two cells is able to transcribe SCR RNA. 
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6.3 . DISCUSSION 

6.3.1. THE SCR GENE REGULATES AN ASYMMETRIC 
DIVISION REQUIRED FOR ROOT RADIAL 
ORGANIZATION 

5 The formation of the cortex and endodermal layers 

in the Arabidopsis root requires two asymmetric divisions. 
In the first, an anticlinal division of the cortex/endodermal 
initial generates two cells with different developmental 
potentials. One will continue to function as an initial, 
while the other undergoes a periclinal division to generate 
the first cells in the endodermal and cortex cell files. 
This second asymmetric division is eliminated in the 
scarecrow mutant, resulting in a single cell layer instead of 
two. The scr mutation appears to have little effect on any 

15 other cell divisions in the root indicating that it is 

involved in regulating a single asymmetric division in this 
organ. Several other mutations have been characterized that 
appear to affect specific cell division pathways in 
Arabidopsis. These include knolle (kn) , in which formation 

20 of the epidermis is impaired (Lukowitz et al . , 1996, Cell 

84:61-71); wooden leg (wol) , in which vascular cell division 
is defective (Scheres et al . , 1995, Development 121:53-62) 
and fass (fs) , in which there are supernumerary cortex and 
vascular cells (Scheres et al . , 1995, Development 121:53-62); 

25 Torres Ruiz & Jurgens, 1994, Development 120:2967-2978). 

Only in the case of scr and short-root (shr) mutants has it 
been shown that the defect is in a specific asymmetric 
division . 

Mutational analyses in several organisms have 
3 q revealed that the genes that regulate asymmetric divisions 
can be specific to a single type of division or can affect 
divisions that are not clonally related (Horvitz & 
Herskowitz, 1992, Cell 68:237-255). In most cases, these 
mutations result in the formation of two identical daughter 
35 cells with similar developmental potentials (Horvitz & 
Herskowitz, 1992, Cell 68:237-255). Both resulting cells 
have the identity of one or the other of the normal daughter 
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cells, an example of which is the swi mutation in 
S. cerevisiae (Nasmyth et al . , 1987, Cell 48:579-587). 
However, there are also examples of mutations that result in 
the formation of chimeric cell types such as the ham-1 
5 mutation in C. elegans (Desai et al . , 1988, Nature 336:638- 
646) . 

6.3.2. SCR INVOLVEMENT IN CELL 

SPECIFICATION OR CELL DIVISION 

10 Genes that regulate asymmetric cell divisions can 

be divided into those that specify the differentiated fates 
of the daughter cells and those that function to effect the 
division of the mother cell (Horvitz & Herskowitz, 1992, 
Cell, 68:237-255). The aberrant cell layer formed in the scr 

15 mutant has differentiated features of both endodermal and 
cortex cells. Thus, scr is in the rare class of asymmetric 
division mutants in which a chimeric cell type is created. 
The ability to express differentiated characteristics of 
cortex and endodermal cells implies that the differentiation 

2 0 pathways for both of these cell types are intact and do not 
require the functional SCR gene. This indicates that SCR is 
involved primarily in regulating a specific cell division, 
and that the correct occurrence of this division can be 
unlinked from cell specification. This is in contrast to the 

25 shr mutant, in which the periclinal division of the 
cortex/endodermal initial also fails to occur and the 
resulting cell lacks endodermal markers (Benfey et al . , 1993, 
Development 119:57-70) and has cortex attributes. A genetic 
analysis was used to address the function of SHR and SCR in 

30 the asymmetric division of the cortex/endodermal initial. 
Placing mutants of each of these genes in a fs mutant 
background answered whether the supernumerary cell divisions 
characteristic of fs were sufficient to restore normal cell 
identities (Scheres et al . , 1995, Development 121:53-62). In 

35 the shr, f s double mutant, there were additional cell layers 
but no endodermal, indicating that the SHR gene has a role in 
specifying cell identity. In the scr,fs double mutant, no 
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alteration in cell identity was observed as compared to fs 
(Scheres et al . , 1995, Development 121:53-62). Taken 
together with the cell marker analysis presented herein, 
these results are consistent with a role for SCR in 
5 generating the division of the mother cell while the SHR gene 
may be involved in specifying the fate of the endodermal 
daughter . 

6.3.3. A ROLE FOR SCR IN EMBRYONIC DEVELOPMENT 

10 At least one additional cell division appears to be 

affected in the scr mutant. During embryonic development, 
the ground tissue does not divide to form the endodermal and 
cortex layers of the embryonic root and hypocotyl . As shown 
herein, expression of SCR was detected in the endodermal 

15 tissue throughout the embryonic axis shortly after this 
division occurs. Thus, SCR may play a direct role in 
regulating both this division and the division of the 
cortex/endodermal initial in the root apical meristem. 
Alternatively, the radial organization established in the / 

20 embryo may somehow act as a template that directs the 

division of the cortex/endodermal initial, thus perpetuating 
the pattern. This is consistent with the finding in the scr 
mutant that the aberrant pattern established in the embryo is 
perpetuated in the primary root. It also is consistent with 

25 a recent study in which the daughter cells of the 

cortex/endodermal initial were laser ablated (van den Berg et 
al., 1995, Nature 378:62-65). When a single daughter cell 
was ablated, it was replaced by a cell that followed the 
normal asymmetric division pattern. When three adjacent 

30 daughter cells were ablated, the central initial divided 
anticlinally but failed to perform the periclinal division 
(van den Berg et al . , 1995, Nature 378:62-65). This provided 
evidence that information from mature cells is required for 
the correct division pattern of cortex/endodermal initials 

3 5 suggesting a "top down" transfer of information. However, 
the absence of a cell layer in lateral roots and callus- 
derived roots of the scr mutant suggests that embryo events 
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are not unique in their ability to establish radial 
organization. Rather, these observations implicate SCR in 
regulating both embryonic and post -embryonic root radial 
organization. 

5 

6.3.4. TISSUE -SPECIFIC EXPRESSION OF SCR IS 

REGULATED AT THE TRANSCRIPTIONAL LEVEL 

Although not intending to be limited to any theory 
or explanation regarding the mechanism of SCR action, the 

10 cloning of the gene and the expression pattern provide some 
clues as to the role of SCR in the regulation of a specific 
asymmetric division. The SCR gene is expressed in the 
cortex/endodermal initial, but immediately after division -is 
restricted to the endodermal lineage. A similar pattern is 

15 seen in the ET199 enhancer trap line in which SCR regulatory 
elements are in proximity to a GUS gene, indicating that SCR 
restriction to the endodermal cell file is due to 
differential regulation of expression of the SCR gene in this 
cell and the first cell in the cortex file. Another marker 

20 line in which expression of GUS is detected only in the 
cortex daughter cell provides a control for differential 
degradation of GUS RNA or protein. Thus, partitioning of SCR 
RNA as a means of achieving this segregation of expression 
can be ruled out. What remains to be determined is whether 

25 this difference in transcriptional activity of the two 

daughter cells is due to internal polarity of the mother cell 
prior to division such that cytoplasmic determinants are 
unequally distributed, or to external polarity that 
influences cell fate after division. Since SCR is expressed 

30 prior to cell division, an attractive hypothesis is that it 
is involved in establishing polarity in the cortex/endodermal 
initial . The sequence of the SCR protein strongly suggests 
that it acts as a transcription factor. Hence, it may act to 
regulate the expression of other genes essential for the 

35 establishment of unequal division. Alternatively, it is 

conceivable that it could play a role in creating an external 
polarity that provides a signal to divide asymmetrically. 
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Its expression in more mature endodermal cells is consistent 
with a role in "top-down" signaling. 

6.3.5. A NEW FAMILY OF TRANSCRIPTIONAL REGULATORS 
5 Analysis of at least eighteen EST clones found in 

the GenBank database reveals that the proteins they encode 
share a high degree of homology with Arabidopsis SCR protein. 
See Tables 1 and 2 and FIGS. 15A-S and 2 8A-AH. Further 
sequence analysis of the encoded proteins indicate that a 

10 high degree of sequence similarity extends from at least the 
highly conserved VHIID domain to the carboxy- terminus of the 
gene products. Comparison of the amino termini of these 
proteins is precluded by the fact that the ESTs are 
incomplete. The high degree of similarity among these 

15 proteins, in combination with the motifs observed in the SCR 
protein (homopolymeric motifs, two leucine heptad repeats and 
a bZIP-like basic domain that may also function as a nuclear 
localization sequence) indicates that these proteins form a 
novel class of regulatory proteins. 

20 The insertion sites of the T-DNA in the two scr 

mutant alleles raised the possibility that the mutant 
phenotype was due to the production of truncated proteins . 
Northern blot analysis indicated SCR RNA is undetectable in 
scr-1. This suggests that the phenotype is either the null, 

25 or due to highly reduced RNA expression. In scr-2 , an 

alteration in RNA size was detected which would be consistent 
with the presence of a functional and possibly truncated 
protein. This could provide an explanation for the 
observation that scr-2 appears to be the weaker allele. 

30 

7. EXAMPLE 2 : ENHANCER TRAP ANALYSIS OF ROOT DEVELOPMENT 
An enhancer trap system was used in order to 
provide a more detailed molecular analysis of gene expression 
in lateral root patterning and development in Arabidopsis 
35 thaliana. A new collection of marker lines that express (3- 
glucuronidase (GUS) activity in a cell -type specific manner 
in each of the cells of the root was generated. These lines 
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allow differentiation of cells to be monitored based on 
molecular characteristics. One of these marker lines, ET199, 
resulted from the integration of the GUS cassette in 
proximity to a SCR enhancer. The results described below 
5 demonstrate that transcriptional activation of the SCR gene 
plays an important role in root development in Arabidopsis, 
and that SCR gene transcriptional regulatory elements can 
express a transgene in a development ally and tissue specific 
manner. 

10 

7.1. MATERIALS AND METHODS 

7.1.1. PLANT GROWTH CONDITIONS: 

Arabidopsis seeds from NO-O and Columbia ecotypes 
were sterilized and sown on MS plates containing 4.5% 
15 sucrose. Plates were oriented vertically and maintained 
under an 18 hours light, 6 hours dark cycle. 

7.1.2. HISTOLOGY AND GUS STAINING: 

For observation of lateral roots, roots were 
20 removed from plates and infiltrated in 25% glycerol for 

several hours to overnight. Roots were then mounted in 50% 
glycerol. Whole seedlings were stained for GUS activity for 
up to three days in the following solution: IX GUS buffer, 
20% methanol, 0.5 mg/ml X-Glu. Addition of methanol greatly 
25 improves the specificity and reproducibility of staining. 

Staining solution was made fresh from a 10X buffer (1 M Tris 
pH7.5, 2 90 mg NaCl , 66 mg K 3 Fe(CN) 6 ) that was stored for no 
more than one week. Stained roots were cleared in glycerol 
and mounted as above. All samples were observed using 
30 Nomarski optics on a Leitz Laborlux S microscope. 

Photographs were taken using a Leitz MPS 5 2 camera, and images 
were scanned into Adobe Photoshop to create figures. In some 
cases the intensity of the blue color was increased. 
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7.1.3. CONSTRUCTION OF ENHANCER TRAP LINES: 
Plant Cloning Vector (PCV) (Koncz et al . , 1994, 
Specialized vectors for gene tagging and expression studies, 
in Plant Molecular Biology Manual . Gelvin & Schilperoort , 
5 eds., Vol. B2, pp. 1-2, Kluover Academic Press, Dordrecht, 
The Netherlands) contains a Bam HI site immediately adjacent 
to the T-DNA right border sequence. The 3 -glucuronidase gene 
fused to the TATA region (-4 6 to 78) of the CaMV 3 5S promoter 
was introduced into this site (Benfey et al . , 1990, EMBO J. 
10 9:1677-1684). 350 transgenic lines were generated by 

Agrobacterium mediated root transformation (Marton & Browse, 
1991, Plant Cell Reports 10:235-239), and 4 independent lines 
from each transformant were screened for GUS activity in the 
root . 

15 

7.2 . RESULTS 

7.2.1. DIFFERENTIATION IN THE LRP 

The marker lines described above reflect patterns 
of gene expression that are specific to individual root cell 

20 types. There are no readily apparent mutant phenotypes in 
any of these lines. Therefore, they can be used to analyze 
the differentiation state of the cells during normal 
development of the lateral root primordial (LRP) . If there 
are stages at which the pericycle cells proliferate in the 

25 absence of patterning, it can be expected that all cells 
would be identical with none expressing differentiated 
characteristics. In contrast, organization of the LRP would 
be reflected in differential patterns of GUS gene expression, 
with certain cells beginning to turn on transcription from 

30 differentiated cell -type specific promoters (i.e., those that 
drive GUS expression in the enhancer trap lines) . 

The process of lateral root formation is divided 
into the following seven stages: 
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Stage I: The LRP is first visible as a set of pericycle 
cells that are clearly shorter in length than their 
neighbors, having undergone a series of anticli 
Laskowski et al., 1995, Dev. 121:3303-3310 predict that there 
5 are approximately 4 founder pericycle cells involved. In the 
longitudinal plane, these divisions result in the formation 
of 8-10 small cells, which enlarge in a radial direction. 

Stage II: A periclinal division occurs that divides the LRP 
10 into two layers (Upper Layer (UL) and Lower Layer (LL) ) . Not 
all the small pericycle-derived cells appear to participate 
in this division -- typically the most peripheral cells do 
not divide. Hence, as the UL and LL cells expand radially,' 
the domed shape of the LRP begins to appear. 

15 

Stage III: The UL divides periclinally, generating a three 
layer primordium comprised of UL1, UL2 and LL. Again, some 
peripheral cells do not divide, creating peripheral regions 
that are one and two cell layers thick. This further 
20 emphasizes the domed shape of the LRP. 

Stage IV: The LL divides periclinally, creating a total of 
four cell layers (UL1, UL2 , LL1, LL2) . At this stage, the 
LRP has penetrated the parent endodermal layer. 

25 

divisions that push the overlying layers up and distort the 
cells in LL1. These divisions are difficult to visualize at 
this stage, but clearly form a knot of mitotic activity. The 
30 LRP at this stage is midway through the parent cortex. The 
outer layer contains 10-12 cells. 

Stage VI: This stage is characterized by several events. 
The four central cells of UL1 divide periclinally. This 
35 division is particularly useful in identifying the median 
longitudinal plane in the enlarging LRP. At this point, 
there are a total of twelve cells in UL1, four in the middle 
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that have undergone the periclinal division and four on 
either side. In addition, all but the most central cells of 
UL2 undergo a periclinal division. At this point the LRP has 
passed through the parent cortex layer and has penetrated the 
5 epidermis. The central cells apparently derived from LL2 
have a distinct elongated shape characteristic of vascular 



Stage VII: As the primordium enlarges, it becomes difficult 
0 to characterize the divisions in the internal layers. 
However, the cells in the outermost layer can still be seen 
very clearly. All of these cells undergo an anticlinal 
division, resulting in 16 central cells (8 cells in each of 
two layers) flanked by 8-10 cells on each side. We refer to 
5 this as the 8-8-8 cell pattern. The LRP appears to be just 
about to emerge from the parent root. 



An enhancer trapping cassette was generated by 

20 fusing the GUS coding sequence to the minimal promoter of the 
35S promoter from CaMV. This minimal promoter does not 
produce a detectable level of GUS expression. However, its 
presence allows other upstream elements to direct GUS 
expression in a developmental and/or cell-specific manner 

25 (Benfey et al., 1990, EMBO J. 9:1677-1684). The use of a 
minimal promoter instead of a promoterless construct allows 
GUS expression to occur even if the enhancer trap cassette 
inserts at a distance from the coding region. Since the 
insert does not have to be within the structural gene, there 

30 are often no mutations generated in the enhancer trap lines. 
The minimal promoter:GUS construct was cloned immediately 
adjacent to the T-DNA right border sequence of PCV (Koncz et 
al., supra) and introduced into Arabidopsis. 350 independent 
lines were generated and analyzed for GUS activity in the 

35 root. The following lines most clearly define each cell 
type. All of the lines were generated through e: 
trapping, as described herein, below, except for O 
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(Dietrich et al . , 1992, Plant Cell 4:1371-1382) and 
EpiGL2:GUS (Masucci et al . , Dev. 122:1253-1260) which are 
transgenic plants that contain cell-type specific promoters 
fused to the GUS gene. 

5 

Ste05 - expresses GUS in the stele including the pericycle 
layer throughout primary and lateral roots. At the root tip, 
staining becomes weaker in the elongation zone; therefore, it 
is likely that only differentiated stele cells express GUS 
10 activity. Stelar GUS expression is seen also in aerial parts 
of the plant . 

Endl95 - expresses GUS in the endodermis of primary and 
lateral roots. Staining can be seen most clearly in the 

15 cells in the meristematic region of the root, although 

over staining shows that more mature cells also express some 
GUS activity. It appears that there is no staining in the 
cortex/endodermal initial, but staining is evident in the 
first daughter cell of this initial. GUS expression is seen 

20 also at the base of young leaves and in- the stipules. 

ET199 - expresses GUS in the endodermis of primary and 
lateral roots, again most clearly in cells in the 
meristematic region. Unlike Endl95, staining in ET199 
25 appears to continue down to the cortex/endodermal initial 
and, in younger roots, even into the cells of the quiescent 
center. Expression in the aerial parts of the plant is 
detectable in the young leaf primordia. 

30 CorAX92 - This line was generated by fusing the 5' and 3' 
sequences from a cortex specific gene isolated from oilseed 
rape to the GUS reporter gene (Dietrich et al . , Plant Cell 
4:1371-1382). Expression is limited to the cortex layer, 
extending to, but not including, the cortex/endodermal 

35 initial. Staining is also apparent in the petioles and leaf 
blades of expanded leaves. 
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EpiGL2:GUS - This line was generated by fusing the GL2 
promoter to the GUS gene (Masucci et al . , Dev. 122:1253- 
1260) . Expression is seen in the non-hair forming epidermal 
cells (atrichoblasts) . Staining is seen near the root tip, 
5 but it is difficult to determine if it includes the epidermal 
initial. Staining is seen also in the trichomes, leaf 
primordia and the epidermis of the hypocotyl and leaf 
petioles . 

10 CRC219 - This line shows staining in the columella root cap 
only. 

LRC244 - This line shows staining in the lateral root cap 
only. 

15 

RC162 - This line shows staining in both the lateral and 
columella root caps. 

Two marker lines show differential staining at 
20 very early stages of LRP development. One of these, ET199, 
presents a complex and dynamic pattern of expression. 
Staining is first apparent at stage II in only the four 
central cells of the UL. At stage III, staining is strongest 
in the central cells of UL2 . As the LRP reaches stage V, the 
25 staining remains strongest in the central 2-4 cells of UL2 . 
By stage VI, staining also begins to extend into the newly 
formed endodermal layer, and staining in both the central 
cells and endodermis persists beyond emergence of the lateral 
root . 

30 Another line, LRB10 (lateral root base) , does not 

express GUS in the primary root tip. Staining in the LRP is 
seen at stage I, and at stage II all the cells of the UL and 
LL are stained. However, by stage IV and V only, the cells 
at the periphery of the LRP still are expressing GUS. As the 

35 LRP develops, these cells continue to stain, although less 
intensely, resulting in a ring of GUS expressing cells at the 
base of the LR. 
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LRB10 and ET199 clearly demonstrate non- identity 
between the cells at very early stages, stage IV in the case 
of LRB10 and within the UL at stage II in ET199. In 
addition, although it is difficult to identify the nature of 
5 the cells that correspond to the observed staining pattern in 
LRB10 and the early staining cells of ET199, post-emergent 
lateral roots show analogous staining in these lines, 
suggesting that the stained cells already are expressing 
markers that reflect their differentiated cell fates. Hence, 
10 these observations suggest a very early onset of 
differentiation in the cells of the LRP. 

7.2.3. ET199 PROVIDES EVIDENCE FOR THE ROLE OF 
SCR IN PLANT DEVELOPMENT 

15 Fortuitously, it was discovered that the GUS 

cassette in ET199 described Section 7.2.2, above, is situated 
approximately 1 kb upstream from the SCR gene. The SCR cDNA 
was labelled and used to probe genomic DNA from WT and ET199 
plants. The band pattern seen in the Southern was completely 

20 consistent with a T-DNA inserted 1 kb upstream of the 

putative SCARECROW start site. Subsequently, a DNA fragment 
was PCR amplified using a primer within the T-DNA and a 
primer within SCARECROW. The size of this fragment was 
consistent also with the predicted insertion site. Partial 

25 sequencing of the PCR fragment confirmed the presence of 

SCARECROW sequence. Mutants in the SCR gene are completely 
lacking one of the radial layers between the epidermis and 
pericycle in both primary and lateral roots, due to the 
absence of specific cell division during embryogenesis and of 

30 the cortex/endodermal initial during post -embryonic growth. 
The expression pattern (described in Section 7.2.2., above) 
that was observed in the central cells of the developing LRP 
of ET199 provides strong evidence that the cells in this 
•region are involved in the establishment of the meristematic 

35 initials. More importantly, these results demonstrate that 
transcriptional activation of the SCR gene plays a major role 
in the development of the Arabidopsis LRP. Furthermore, 
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these results demonstrate that a transgene can be expressed 
under the control of SCR gene transcriptional regulatory- 
elements in a developmental and tissue -specific manner. 

5 8- EXAMPLE 3: ACTIVITY OF ARABIDOPSIS SCR 
PROMOTER IN TRANSGENIC ROOTS 

The expression pattern of Arabidopsis SCR has 

been determined by analysis of an enhancer trap line, ET199, 

in which a GUS coding region with a minimal promoter was 

0 fortuitously inserted 1 kb upstream of the SCR coding region 

(see supra) . In ET199 plants, GUS expression is detected in 

the endodermis, endodermal initials and sometimes in the 

quiescent center (QC) of the root. See supra and Malamy and 

Benfey, 1997, Dev. 124:33-44. This expression pattern of SCR 

2 in the primary root has been confirmed by in situ analysis 

(See supra and Di Laurenzio et al . , 1996, Cell 86:423-433). 

The following experiments demonstrate that 2.5 kb 

of 5 ' sequence upstream of the Arabidopsis SCR coding region 

is sufficient to confer SCR expression pattern to a 

, 0 heterologous gene. The 5' sequence used in these studies 

starts from the Hind III site approximately 2.5 kb upstream 

of the ATG initiation site and extends 3 ' downstream to the 

base pair immediately upstream of the ATG initiation site 

(see FIG. 14) . This 5' sequence was fused to a GUS coding 

, 5 sequence. The resulting SCR promoter :: GUS construct was 

incorporated into an Agrrobacterium vector, which was used to 

transform and generate transgenic roots using standard 

procedures . 

A large number of roots were regenerated. They 
,q show GUS staining pattern that is similar to the SCR 

expression pattern in ET199 plants (Figure 19, Panel f) . 
Since organs regenerated from callus often have an abnormal 
morphology, transgenic roots were transferred to liquid 
.culture. Roots grown in liquid culture appeared 
,g morphologically normal and showed GUS expression in the 
endodermis, endodermal initial and QC (Figure 19, Panel g) , 
similar to the expression pattern of SCR seen in the 
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enhancer trap line ET199. These results indicate that the 
2.5 kb region upstream of the SCR start site is sufficient to 
confer the SCR expression pattern in the root . 

The expression of the SCR promoter: :GUS construct 
5 was examined also in the scr mutant background. The scr 
mutant has an altered root organization (see, supra) . 
Whereas the wild-type root of Arabidopsis has four distinct 
cell layers surrounding the vascular tissue, the roots of scr 
mutant have only three . 

10 Transgenic roots of the scr mutant that contained 

a SCR promoter: :GUS construct were generated. As in the 
wild-type, a large number of transgenic roots were formed 
that had detectable GUS expression (Figure 20, Panel a) . 
These roots were shorter than wild- type regenerated roots, 

15 consistent with the shorter root phenotype of the scr mutant. 
Additional transgenic root experiments 
demonstrated that the SCR gene under control of its own 
promoter can rescue the scr mutant phenotype . Transgenic scr 
roots were generated that contained the full length SCR gene 

20 under the control of its own promoter. The length of 

transgenic roots containing the construct were longer than 
those of the scr mutant, indicating that the introduced SCR 
gene partially rescued the mutant . Whereas scr regenerated 
roots that carried the SCR promoter: : GUS construct were very 

25 short (Figure 21, Panel a; and Figure 20, Panel a) , roots 
transformed with the SCR promoter and coding region were 
noticeably longer (Figure 21, Panel b) . The difference was 
even more obvious in liquid culture, in which scr mutant 
roots remained short (Figure 21, Panel c) , while SCR gene 

30 complemented scr mutant roots were long and resembled wild- 
type roots (Figure 21, Panel d) . 

Anatomical studies of the regenerated roots 
confirmed the ability of the SCR promoter: -.SCR gene construct 
to rescue the scr mutant phenotype. Whereas regenerated 

35 roots of scr mutants were missing an internal layer (Figure 
21, Panel e) , the scr mutant roots that were transformed with 
the SCR promoter : : SCR gene construct had a radial 
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organization that resembled wild-type root (Figure 21, 

9. EXAMPLE 4: ISOLATION OF SCR SEQUENCES USING 

5 PCR-CLONING STRATEGY 

Based on the comparison of the sequences of SCR 
paralogs in Arabidopsis, degenerate primers SCR3AII, SCR5AII 
and SCR5B were designed and used in PCR amplification of SCR 
sequences from genomic DNA of various plant species. The 

10 amplification was performed according to conditions described 
in Section 5.1.1., supra, using DNA isolated from maize 
plants grown from a commercial seed mixture. Amplification 
products (104 bp fragment for the SCR5B+SCR3AII primer 
combination; 146 bp fragment for the SCR5AI I + SCR3AI I primer 

15 combination) were obtained, and each cloned into a T/A vector 
(Invitrogen, San Diego, CA) and sequenced. Two of the three 
different types of clones obtained had deduced amino acid 
sequences that were very similar to a part of the Arabidopsis 
SCR protein (i.e., approximately 90% identity), suggesting 

20 that they represent parts from two different alleles of the 
maize SCR gene (i.e., ZCR gene) . The two clones each had 
only two conservative changes in their nucleotide sequence. 

The 14G bp amplification product, ZmScll, was 
subsequently used as a probe for screening of a genomic 

25 library generated in lambda BlueSTAR vector (NOVAGEN) from 
maize (Hill line) genomic DNA. The screening was performed 
according to the standard procedures described in Genius" 
System User's Guide For Membrane HvPriaization (Boehringer- 
Mannheim) : The probe was a single-strand DNA molecule 

30 corresponding to the ZmScll fragment produced by PCR (Genius, 
Boehringer-Mannheim) . Hybridization was performed according 
to recommendations of the manufacturer's manual 
(Boehringer-Mannheim) . Prehybridization was for 2 hr in 50% 
formamide hybridization solution at 42°C. Hybridization was 

J5 overnight at 42°C with 200 ng/ml probe concentration. 

Filters were washed twice at room temperature in 2x SSC, 0.1% 
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SDS for 5 min, and for stringent washing at 65°C in 0.5x SSC, 
0.1% SDS twice for 15 min. 

A positive clone was identified. The clone 
contained a 13 kb insert, which was subcloned into a plasmid 
5 vector. The resulting plasmid was designated pZCR. A 5 kb 
Eco RI fragment containing the maize SCR (ZCR) sequence was 
subcloned and sequenced. The nucleotide sequence of the 
region containing a partial ZCR coding sequence is shown in 
FIG. 17A and the corresponding deduced amino acid sequence is 

10 shown in FIG. 17B. The ZCR protein contains a segment that 
is highly homologous to a corresponding segment in the 
Arabidopsis SCR protein (FIG. 17B) . This segment is flanked 
by segments of low homology. Thus, it is possible that the 
genomic clone of ZCR is a composite clone, containing 

15 sequences that are not ZCR sequences. 

The deduced ZCR protein sequence was aligned with 
that of Arabidopsis SCR protein. The comparison revealed new 
conserved sites in the SCR coding sequence which were used to 
design new, more specific PCR primers (i.e., IF, 1R, and 4R) 

20 for use in amplification of SCR sequences from yet other 
plant species. 

Using combinations of primers 1F+1R and 1F+4R, 
PCR amplification was performed as described in section 
5.1.1.. Two DNAs of expected size were obtained from 

25 soybean: a 24 7 bp DNA from the 1F+1R primer combination and 
a 3 79 bp DNA from the 1F+4R primer combination. A DNA of 
expected size (24 7 kb) was obtained from carrot and spruce 
when their genomic DNA was amplified using the 1F+4R primer 
combination. The nucleotide sequences of the 379 kb soybean 

30 DNA (SCLgl) , the 247 kb DNA from carrot (SCLdl) and spruce 
{SCLpl) are shown in FIGS. 16K-M. The corresponding deduced 
amino acid sequences of these amplified sequences are shown 
in FIG. 18. Comparison of these partial SCR coding sequences 
indicate this approach isolated DNA sequences that encode SCR 

35 proteins with amino acid sequences that are very similar, but 
not identical, to a segment of Arabidopsis SCR protein (see 
FIG . 18) . 
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10. EXAMPLE 5. EXPRESSION PATTERN OF MAIZE ZCR GENE 
IN ROOT TISSUE ; 

These experiments examined the expression pattern 
of ZCR in the primary root and quiescent centers of maize 
root. The expression pattern was determined by in situ 
hybridization using a ZCR RNA probe, corresponding to an 
amino acid segment region that is highly homologous to a 
corresponding segment of the Arabidopsis SCR protein. The 
experiment was carried out as follows. Restriction fragments 
containing the maize ZCR sequence were isolated from pZCR and 
subcloned into a pBluescript vector for in vitro 
transcription. The probe was synthesized using conditions 
described in the Genius Dig RNA labeling kit. The 
pBluescript plasmid was linearized, and 1 pg was used as a 
template to synthesize digoxigenin-labeled RNA using the T7 
polymerase. The RNA probe was subjected to mild alkali 
hydrolysis by heating at 60°C for 1 hr in 100 mM carbonate 
buffer (pH 10.2) to yield a probe size of approximately 0.15 
kb. Probe concentration for hybridization was optimized at 1 
/zg/ml/kb. In situ hybridization of root tips from 48 to 72 
hr-old maize seedlings or excised quiescent centers (QCs) of 
roots were carried out following procedures described in 
Section 6.1.6., supra. 

The results show that ZCR expression in maize 
primary roots is localized to a file of cells that is 
identified as the endodermal layer. The expression pattern 
continues in a single uninterrupted file through the QC which 
consists of approximately 1000-1500 cells (FIG. 22) . 

In two- week old regenerating QCs, ZCR expression 
is found in a file of cells extending through the newly 
formed apex. Thus, the regenerated roots exhibit a ZCR 
expression pattern that is similar to that seen in the 
primary root, even though the root apex does not contain the 
•normal arrangement of cell files at this stage. 

ZCR expression during regeneration of the root 
apex also was examined. In the initial stages of 
regeneration, cell proliferation occurs to fill in the 
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removed tissue and begins to regenerate the basic shape of 
the root tip. All cells on the blunt edge of the root appear 
to contribute to the new population of cells. The ZCR 
expression pattern indicates that molecular signals are 
5 differentially present in these cells at an early stage in 
regeneration. The gene appears to be diagnostic of cells 
that are preparing to undergo asymmetrical division in order 
to re-establish the normal organization of the root apex from 
the large undifferentiated cells. The results indicate that 
10 ZCR expression is required for pattern formation since it is 
expressed prior to the generation of any specific anatomical 
pattern in the newly formed root tissue. 

11. EXAMPLE 6. EXPRESSION PATTERN OF SCR 

15 GENE IN SOYBEAN ROOTS AND ROOT NODULES 

SCR expression in soybean roots and nodules was 

examined using in situ hybridization with a SCR probe. The 

procedures used were as described in Sections 6.1.6. and 10. 

In primary roots, SCR is expressed in the 

2Q endodermis. Expression was found also in cells at the root 
tip that are located at the distal end of the endodermal cell 
files. In soybean nodules, expression of SCR was detected in 
the peripheral tissue at the site of developing vascular 
strands. At later stages of vascular development within the 

25 nodule, SCR expression was found flanking the vascular 
tissue. These results indicate that SCR is involved in 
regulating vascularization in the nodule by contributing to 
the radial organization that is required to generate 
endodermis. These findings indicate that the SCR promoter 

3Q may be used to express proteins in a highly tissue-specific 
manner in soybean nodules. One application is to use the SCR 
promoter to engineer nodules through production of components 
in a tissue-specific manner. Another application is that 
modification of the expression of SCR could enhance nodule 

35 activity by improving vascularization and/or the number of 
endodermal layers . 
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12. EXAMPLE 7. SCR EXPRESSION AFFECTS 

GRAVITROPISM OF AERIAL STRUCTURES 

In addition to being defective in specific 
embryonic and postembryonic meristematic divisions, both the 
5 scr and the shr mutants have shoots that exhibit severely 
defective gravitropism. Complementation analysis showed that 
scr is allelic to a sgr (shoot gravitropism) mutant, sgrl. 
Four mutant alleles of SCR (i.e., scrl, scr2, sgrl-1 and 
sgrl -2) have been identified. All four of these mutants have 
normal root gravitropism and defective shoot gravitropism. 

Etiolated hypocotyls of scr mutants placed on 
their sides do not respond to gravity even after 3 hr. 
Similar behaviors were observed with the inflorescence stems 
of sgrl-1 mutant, which do not curve upwards even after two 
15 days on their sides. In contrast, the roots of these plants 
respond rapidly to the change in orientation with the same 
kinetics as the wild type. Thus, mutations in the SCR gene 
lead to a radial pattern deficiency in the root but have no 
effect on root gravitropism. 
2q Comparable results were obtained also for shr 

roots and for hypocotyls and inflorescence stems, i.e., data 
indicate that shr shows normal root gravitropism but almost 
no stem gravitropism. 

25 13. EXAMPLE 8. MAIZE ZCR GENE 

This example describes the cloning and expression 
pattern of the maize ZCR gene, an ortholog of the Arabidopsis 
SCR gene . 

30 13.1. CLONING THE MAIZE ZCR GENE 

In order to clone the maize ortholog of the 
Arbidopsis SCR gene, a reverse genetic technology strategy 
was utilized. With this strategy, it is possible to clone 
genes from across taxonomic boundaries, such as from genes 

35 identified in model organisms like Arabidopsis to those 
embedded in more complex genomes such as maize. 
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More specifically, using the deduced amino acid 
sequence of the Arabidopsis SCR gene in the reverse genetic 
technology strategy, multiple maize EST sequences related to 
SCR were isolated. One of them appeared very homologous to 
5 SCR, having greater than 77% sequence identity. 

Using this highly homologous EST sequence as a 
probe, three genomic clones from a B73 inbred maize genomic 
library were isolated. Based upon restriction enzyme 
analysis, the three genomic clones appeared to be overlapping 

10 portions of the same genomic region. 

Subsequently, a 5kb Sail fragment from one of the 
three clones was subcloned into pBluescript SK(-) and 
sequenced. The sequence analysis of the cloned maize gene 
revealed that it consists of two exons and one intron in one 

15 open-reading frame (ORF) encoding 668 amino acids. The 
presence of an in- frame stop codon located 5' to the 
initiating ATG and nearby stop codons in the other two 
reading frames suggests that the long ORF of this genomic 
clone encodes the functional, full length protein. See, FIG. 

20 25. 

After obtaining the full length maize sequence, a 
database search was performed to find homologous sequences. 
The database search revealed that the newly isolated maize 
sequence was most homologous with the Arabidopsis SCR gene at 

25 the amino acid level. Comparison of the maize and 

Arabidopsis sequences indicated that the similarity between 
the Arabidopsis SCR and the maize ZCR gene extended beyond 
the VHIID domain into both the N- and C-termini (FIG. 26) . 
Although the N-terminal region of the maize ortholog and the 

30 Arabidopsis SCR gene appears more divergent, the maize ZCR 
gene has the homopolymeric stretches characteristic of SCR 
(Gerber et al . , 1994, Science 263:808-811; Johnson, et al . , 
1993, J. Nutr. Biochem. 4:386-398). 

In addition, the ZCR gene has other motifs 

35 characteristic of SCR: two putative leucine heptad repeats, 
which have been shown in other proteins to mediate 
protein-protein interactions; and a stretch of basic residues 
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similar to the basic domain of bZIP proteins, which have been 
shown not only to mediate DNA-binding, but also nuclear 
localization (Hurst, H.C. , 1994, Protein Prof. 1:123-168). 
Moreover, the ZCR gene has three copies of an LXXLL motif in 
5 the N-terminal region, which has been shown to mediate the 
binding of a steroid receptor coactivator complex to nuclear 
receptors (Heery, et al . , 1997, Nature 3 87:733-736; Torchia, 
et al., 1997, Nature 387:677-684). See, FIG. 26. Similarly, 
the GAI and RGA gene products also contain a copy of this 

10 sequence. In these genes, the sequence is believed to be 
involved in a gibberellin signal transduction pathway (Peng, 
et al., 1997, Genes Dev. 11:3194-3205; Silverstone, et al., 
1998, Plant Cell 10:155-169) . 

Although the functionality of these putative 

15 motifs has not been clearly demonstrated, the fact that all 
of these putative motifs exist in a single polypeptide 
strongly suggests that the maize ZCR is a transcription 
factor similar to the Arabidopsis SCR gene. In addition, the 
structure of the ZCR gene is very homologous to that of the 

20 SCR gene. Specifically, the position of the intron is 

conserved, although the size and sequence of the intron is 
different in the two genes. 

In addition to the maize ZCR gene, a 3.2kb 
fragment upstream of the initiating ATG of the maize gene was 

25 isolated. This region, similar to numerous other upstream 
regions in other genes, likely contains regulatory elements 
of the ZCR gene. Furthermore, this upstream region can be 
analyzed and utilized similar to the upstream region of the 
SCR gene, discussed supra. 

30 FIG. 32 shows an RNA blot analysis in which 

either total RNA or poly-A selected RNA from roots and shoots 
were probed with the full-length ZCR cDNA. As shown in the 
figure, the probe hybridized to a band that is approximately 
2.6 kilobases in size. 

35 FIG. 33 shows the partial nucleic acid and amino 

acid sequence of CBPBT44, a gene which has significant 
homology to both the Arabidopsis SCR and the maize ZCR genes. 
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FIG. 34 represents an alignment of the three genes. As shown 
in FIG. 34, the three genes share a high degree of homology, 
including, but not limited to, the leucine heptad repeats. 
To further demonstrate the homology between the maize ZCR 
5 gene and the CBPBT44 partial sequence, a Southern blot 

analysis was performed. See, FIG. 35. FIG. 35 demonstrates 
that CBPBT44 (right pane, lane C) is the source of some of 
the bands picked up by the maize ZCR cDNA (right panel, lane 
A) . Thus, it is likely that CBPBT44 is a closely related 
10 gene to the ZCR gene, and that CBPBT44 may represent a 

duplicated copy of the maize ZCR gene in the maize genome. 
This possibility is strengthened by the fact that maize is 
thought to have undergone a general duplication of its genome 
during its evolution. 



13.2. EXPRESSION PATTERN OF THE MAIZE ZCR GENE 
20 In order to understand the function of the maize 

ZCR ortholog, the expression pattern of the maize ortholog 
was examined in various types of roots, including, but not 
limited to, the maize primary, embryonic, lateral, seminal 
lateral and adventitious roots by RNA in situ hybridization. 
25 Suprisingly, in spite of the profound differences of the root 
architecture between maize and Arabidopsis (FIG. 23) , the 
expression pattern of the maize ZCR is remarkably similar to 
that of the Arabidopsis SCR in that expression is found only 
in the endodermis cell lineage (Fig. 22A-C) . Furthermore, it 
30 is expressed in the embryonic root and lateral root (FIG. 
22D-F) . 

Interestingly, ZCR expression also was found to 
extend through the QC (FIG. 22A-B) . Expression through the 
QC was confirmed by observations of the expression pattern in 
35 serially cut sections. This demonstrates the first evidence 
for cell -specific expression within the QC, which has long 
been considered to be undifferentiated and probably 
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multipotent, analogous to stem cells in animals (Barlow, 
P.W., 1976, J. Theor. Biol. 57:433-451; Barlow, P.W., 1978, 
In Stem cells and tissue homeostasis (Lord, B. I., Potten, C. 
S. and Cole, R. J. eds) , (Cambridge: Cambridge University 
5 Pess) ) . In addition, this finding raises the possibility 
that radial organization is established in the mitotically 
inactive narrow region where cell files converge. 

14. EXAMPLE 9: MAIZE ZCR GENE EXPRESSION 

10 DURING REGENERATION OF THE ROOT TIP 

This example describes the expression of the 
maize ZCR gene during regeneration of the root apex after 
excision of the QC. Expression after removal of the root cap 
and immediately after QC excision did not show any alteration 

15 in its pattern (FIGS. 27A-B) . 

At 24 hours after removal of the QC, the excised 
tissue began to be replaced, reforming the basic shape of the 
root tip. Expression was found in the endodermal cell file 
of the unexcised portion of the root as well as in the newly 

20 formed cells at the base of the endodermal cell files. The 
lack of its expression in the cells below this region 
indicates that it is activated only after initial 
proliferation and partial restoration of the apex. Moreover, 
expression was found also in isolated cells located between 

25 the cell files (FIG. 27C) . Examination of serially cut 

transverse sections indicated that these internal cells were 
not directly adjacent to any other cells expressing the gene 

lineage requirement for the isolated cells expressing the 
30 maize ZCfi gene. 

At 46 hours after excision of the QC, expression 

of the maize ZCR was found in a band of cells that is nearly 

perpendicular to the base of the endodermal cell files (FIG. 

27E) . At this stage, the root tip had regained its normal 
35 external shape, although longitudinal sections show that the 

cell files are not organized into the converging files seen 

in the normal root anatomy. 
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At 72 hours, the expression of the maize ZCR gene 
pattern resembled that found in the unexcised root, although 
the anatomical pattern was not yet restored (FIG. 27F) . 
Between 72 and 96 hours, there was an anatomical shift such 

5 that files became convergent at the tip. Finally, by 96 
hours following excision of the QC, ZCR gene expression was 
found to be localized to a single file of cells extending 
through the tip in a manner similar to that seen in the 
primary root (FIG. 2 7G) . 

0 These results show that the expression pattern of 

the maize ortholog converges at the root tip prior to the 
anatomical pattern of the root. Thus, ZCR gene expression 
prepatterns radial organization of the root. The progressive 
refinement of the expression pattern suggests that radial 

5 patterning is regenerated by processes that involve 

positional information possibly transmitted through cell -cell 
signaling within the regenerating region. 



15 . DEPOSIT OF MICROORGANISMS 
0 The following microorganisms have been deposited 

in accordance with the terms of the Budapest Treaty with the 
American Type Culture Collection; 12301 Parklawn Drive, 
Rockville, MD 20852, U.S.A., on the dates indicated: 



5 



Microorganism 



DH5ot 



Accession 

No. 



98031 



Date 
April 26, 1996 



DH5a 
DH5ct 
DH5a 
DH5ot 



pNYHl (Zm-scllb) 

pNYH2 (Zm-scll) 

pNYH3 (Zm-scl2) 
pZCR 



98032 
98033 
98034 
97992 



April 26, 1996 

April 26, 1996 

April 26, 1996 

April 18, 1997 



Although the invention is described in detail 
with reference to specific embodiments thereof, it will be 
5 understood that variations which are functionally equivalent 
are within the scope of this invention. Indeed, various 
modifications of the invention in addition to those shown and 
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described herein will become apparent to those skilled in the 
art from the foregoing description and accompanying drawings. 
Such modifications are intended to fall within the scope of 
the appended claims . 

Various publications are cited herein, each of 
the disclosures of which is incorporated by reference in its 
entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes a SCARECROW protein 

5 containing an amino acid sequence substantially similar to 
the sequence of MOTIF III (VHIID) of Arabidopsis SCR protein 
shown in FIGS. 13A-F. 

2 . An isolated nucleic acid molecule comprising (a) a 

10 nucleotide sequence that encodes a scarecrow protein having 
the amino acid sequence of SEQ ID N0:2, SEQ ID NO: 19, SEQ ID 
N0:21, SEQ ID NO:23, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO: 36, SEQ ID NO: 37, SEQ ID NO:41, SEQ ID NO: 42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 

15 NO: 50, SEQ ID NO: 52, SEQ ID NO: 54, SEQ ID NO: 56, SEQ ID 
NO:58, SEQ ID NO:59, SEQ ID NO:61, SEQ ID NO:63, SEQ ID 
NO: 65, SEQ ID NO: 67 or the amino acid sequence shown in FIG. 
25, FIG. 28AB, FIG. 28AC, FIG. 28AD, FIG. 28AE, FIG. 28AF, 
FIG. 28AG or FIG. 28AH; or (b) the complement of the 

20 nucleotide sequence of (a) . 

3. An isolated nucleic acid molecule comprising a 
nucleotide sequence that hybridizes to the nucleic acid of 
Claim 2 and encodes a naturally occurring SCR gene product. 

25 

4. A nucleic acid molecule comprising (a) a nucleotide 
sequence that encodes a SCR protein lacking one to four of 
the following motifs delineated in FIGS. 13A-F: MOTIF I, 
MOTIF II, MOTIF III, MOTIF IV, MOTIF V, or MOTIF VI; or (b) 

30 the complement of the nucleotide sequence of (a) . 

5. A nucleic acid molecule comprising (a) a nucleotide 
sequence that encodes a polypeptide corresponding to MOTIF I, 
MOTIF II, MOTIF IV, MOTIF V or MOTIF VI of the SCARECROW 

35 protein delineated in FIGS. 13A-F; or (b) the complement of 
the nucleotide sequence of (a) . 
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6. The isolated nucleic acid molecule of Claim 1 comprising 
the nucleic acid sequence of SEQ ID N0:1, SEQ ID NO: 18, SEQ 
ID NO:20, SEQ ID N0:22, SEQ ID NO:45, SEQ ID NO:47, SEQ ID 
NO:49, SEQ ID NO:51, SEQ ID NO:53, SEQ ID NO: 55, SEQ ID 

5 NO: 57, SEQ ID NO: 60, SEQ ID NO: 62, SEQ ID NO: 64, SEQ ID NO: 66 
or the nucleic acid sequence shown in FIG. 25, FIG. 28A, FIG. 
28B, FIG. 28C, FIG . 28D, FIG. 28E, FIG. 28F, FIG. 28G, FIG. 
28H, FIG. 281, FIG. 28J, FIG. 28K, FIG. 28L, FIG. 28M, FIG. 
28N, FIG. 280, FIG. 28P, FIG. 28Q, FIG. 28R, FIG. 28S, FIG. 
10 28T, FIG. 28U, FIG. 28V, FIG. 28W, FIG. 28X, FIG. 28Y, FIG. 
28Z or FIG. 28AA. 

7. A DNA vector containing the nucleic acid molecule of • 
Claim 1, 2, 3, 4, 5, or 6. 

15 

8. An expression vector containing the nucleic acid 
molecule of Claim 1, 2, 3, 4, 5, or 6 operatively associated 
with a regulatory sequence containing transcriptional and 
translational regulatory elements that control expression of 

20 the nucleotide sequence in a host cell. 

9. A genetically-engineered host cell containing the 
nucleic acid molecule of Claim 1, 2, 3, 4, 5, or 6 . 

25 10 . A genetically-engineered host cell containing the 
nucleic acid molecule of Claim 1, 2, 3, 4, 5, or 6 
operatively associated with a regulatory sequence containing 
transcriptional and translational regulatory elements that 
control expression of the nucleotide sequence in a host cell. 

30 

11. An isolated SCARECROW protein. 

12. The protein of Claim 11 having the amino acid sequence 
shown in FIG. 25. 

35 
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13. A SCARECROW protein lacking one to four of the following 
motifs delineated in FIGS. 13A-F: MOTIF I, MOTIF II, MOTIF 

III, MOTIF VI, MOTIF V, or MOTIF VI. 

5 14. A polypeptide corresponding to MOTIF I, MOTIF II, MOTIF 

IV, MOTIF V or MOTIF VI of the SCARECROW protein as 
delineated in FIGS. 13A-F. 

15. An antibody that immunospecif ically binds the protein or 
10 polypeptide of Claim 11, 12, 13 or 14. 

16. An anti- idiotypic antibody that mimics an epitope of 
SCARECROW protein. 

15 17. A plant genetically-engineered to overexpress or 

underexpress a SCARECROW protein or polypeptide, so that cell 
division is modified, and root and/or stem development is 
altered. 

20 18. A plant genetically-engineered to overexpress a 

SCARECROW protein or polypeptide, so that cell division is 
increased in roots, resulting in thicker root development. 

19. A transgenic plant containing a transgene having the 
25 nucleic acid molecule of Claim 1, 2, 3, 4, 5, or 6. 

20. A transgenic plant containing a transgene having the 
nucleic acid molecule of Claim 1, 2, 3, 4, 5, or 6 
operatively associated with a regulatory sequence containing 

30 transcriptional and translat ional regulatory elements that 
control expression of the nucleotide sequence in a transgenic 
plant cell. 

21. The transgenic plant of Claim 19, in which the transgene 
3 5 encodes an antisense nucleotide sequence that suppresses 

expression of endogenous SCARECROW gene product, so that cell 
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division is decreased in roots, resulting in thinner root 
development . 

22. A genetically-engineered plant in which the endogenous 
5 SCARECROW gene is disrupted or inactivated so that cell 

division is decreased in roots, resulting in thinner root 
development . 

23 . A transgenic plant containing a transgene encoding a 
10 gene of interest operatively associated with a SCARECROW 

promoter, so that the gene of interest is expressed in a 
tissue-specific manner in roots or embryos. 

24. The transgenic plant of Claim 23, in which the gene of 
15 interest encodes a gene product that confers herbicide, salt, 

pathogen, or insect resistance. 

25. A transgenic plant containing a transgene encoding a 
gene of interest operatively associated with a SCARECROW 

20 promoter, so that the gene of interest is expressed in 
shoots . 

26. The transgenic plant of Claim 25, in which the gene of 
interest encodes a gene product that increases starch, lignin 

25 or cellulose biosynthesis. 

27. A plant genetically-engineered to overexpress or 
underexpress the SCARECROW protein so that gravitropism of 
the stem or hypocotyl is altered. 

30 

28. The plant of Claim 27, which is less susceptible to 
lodging than a wild-type plant. 



35 
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i o t s t ft X 




ccroc<o«<«n«a^cciooc*crc»coo3Acr^ 

GCtTX>CCTCAfiAAACTT0CXAACTTCCACAClCACA<£ACTCAA1 



t r 0 »« 



C « 0 (. L X C 

rAAAO7r*r*TCTTT*0C10S»»JkT0C»OCTAC 
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T-DNA T-DNA 
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SCR bZIP-like domain 

GCN4 (yeast) 

TGA1 (Arabidopsis) 

C-Fos (mouse) 

c-JDN (human) 

CREB (human) 

Opaque-2 (maize) 

OBF2 (maize) 

RAF— 1 (rice) 



PAVQTNTAEALRERKEEIKRQKQ D 

II I I I =111 

LKRAROTEAARRSRARKLQRMKQ L 

RRLAQNREAARKSRLRKKAYVQQ L 

IRRERNKMAAAKCRNRRRELTDT L 

RKRMRNRIAASKCRKRKLERIAR JL 

VRLMKtniEAAIlECTRKKKEYVI^Cr * L 

KRKESNRESARRSRYRKAAHLKE L 

MRQIRNRDSAMKSRERKKSYIKD L 

RRMVSNRESARRSRKKKQAHLAD L 



F I <5, SC. 
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SCR.VHXXD domain 

1 

SCR J^EKEDSVHICTLDXMQGIjQWPGtiFHXlASREGGPPHVRI.TGI, 

F13 89 6 AVKNESFVHIIDFQISQGGQWVSI-IRAI.GARPGGPPNVRITCI 

Z37192 AMEGEKMVHVIDLDASEPAQWLALLQAFNSRPEGPEHLRITGy 

Z2 5 6 4 S AIKGEEEVHIIDFDINQGNQYMTr.IRSXA 

D41474 IHVIDFXLGVGGQWASFLQEXAHRRG 

T183 10 VHIIXFXLMQGLQWPALMDVFSARKGGPPKLRITGX 
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KetAXaciuserGlyAspPhcAcnClyClyClnProProProHlsserProDeuArgTht 
VtirSerSerClySerSerSerSerAsnAsnArgGlyproProProProProPcoPifoPro 
I<euValKetVa3LAr9LyeXrgL«uAlaSerCluMet2erSerXGnProAspTyrAsnA6t> 
SerSerJU^ProProAryArgValsetlfisIieur^uAspSerAsnTyrAsnThrVaXThr 
ProoinGlnProProSttLeuXtu:M.aAlaM.aTbrValSers«r(:inPro\snProPr(> 
I^uScrValcyGGlyPheSciXJlylAJuProVairUeProScrAspAxgGlyGlyArvXsn 
ValKetMetSet^alGlnProMet&spGlnXspserSerSerSerSerAlaSerPftiThx: 
ValTrpValA«pXlaIleilcAr^A«pl^uIlclll^SerSerthrSerValSerIlcPro 
GXnLeuXXeGXnAsnVaXArgAspXXeXXePheProCysAsnPro\snLeuGXyAXaI<eu 
X<euGluTyrAr9lAUArgSerX«uKetXfeuL«uAspPraSerSecSerSerAspPraSer 
ProcinThrPticGluProt«uTyrGlnXieScrXsnXsnProSerPr«ProCXnGlnCiLn 
ClncinHiEGlnGXnGlnGlnGlnGlnKisLysProProProProPcoXleGLnGlnGln 
GXuRr^rGXtiAsnSerSerThrAspAXaProProGXnPiroGXuThrValThrXXaThrVaX 
4>roAlaVaXCXnThrAsnThrAXaGXuAXaL«uArgrGXUAr9l,ysGXuGXuXXeI,ysArg 
GXnx.ysGXnAspGluGXuGXyLcuHisLeuX>euThrI<euX«uLeuGXnCysAXaGXuA.Xa 
VaXserM.aAspA8ni^uGXuGXuAlaAsnL.yeLeuLeuLeuGXuXXeSerGXnLeuSer 
ThrProTyrGlyThrScrAXaGXnArgVaXMaAlaTyrPheSerGlviAXaKetSerAXa 
XrgL^ul^uXsnSercysLcticiyiXeTyrXlaXXal^suProserArgTrpHetProcin 
ThrHisSerl^euLysMetVaXScrAlaPheGlnValPheAsnGlyXIeSerProLeuVaX 
LysPheSerHisPhcThrAlaAsnGlnAXalleGlnGloXlaPheCluLysGluAspSer 
ValHtsIXelXeAspLeuAspIXcHctGlnGXyLeuGlnTrpProGliyLcuPheHisIXe 
LcuAXaSerArgProGlyGlyProProHisValArgLeuThrGlyLeuGlyThrScrMct 
cXuAXalMSUCXnAlaThrClyLysArgLeuSerAspPheThrAspLysLcuGlyLeuPro 
Ph«GXuPheCysPraLeuAXaGluLysValGlyAsnI<euAspThrCl.uArglJeuAsnVal 
ArgLysArgGXuAXaVaXAXaVaXKisTrpLeuGXnHlsSerL«uTyrAspVaXThrGXy 

SerAspAXaHisThrLeuTrpLeuX^uGXnArgLeuAlaProLysValVaXTUrVaXVaX 
GXuCXnAspI^uS«rKisAXaGXySerPheI>uGXyArgPhcVaXCluAl.aXleHtsTyr 
TyrSerAXat«uPheAspSerr^uGlyAXaS«rTyrtlyGluGluSerGluGluArgHis 
VaXVaXGXuGXnGlnLeuLeuSerl.ysGXuIleArgAsnValLeuAlaVaXGlyGlyPro 
SerArgScrGXyCluVall.ysPh«5GluScrTrpArgGXuLysHetGlnGJ.nCysGlyPhe 
X.yscXyIleSerL«uAlaGlyAsnAlaAlaThcGlnAlaThrLeuLeuLcuGly«etPhe 
ProSerAspGlyTyrThrl^uValAspAspAsnGlyThrLcuLyELcuGlyTrpLysAsp 
LeuSeirL«uLeuThrAlaSerAlaTrpThcPcoArgSerSTOP 
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WT scr-1 scr-2 
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10 20 30 40 SO 

1234567890 1234567890 1234567890 1234567890 1234567890 

G3CkQ3AiXC CAA0333ICC AOFEAIAIQC ATKTCITCIA SO 

GTSP TGP E If L TYMH I I* Y 

OLTiml'iUft. AKi'iUJai'iA AfclGGRGCIA 100 

E A C P Y F K FGY E S A N G A I 

T&GCTCft&X: GAASGTITJG TO3£2H"I?tT aSTITOCftG ISO 

A E A VKN ESFV HII DFQ 

ATTTCICAftG GIC33ICAKD3 G3IGAG1TIG ATO33IGCTC TlU3iU2ERG 200 
ISQG GQW V S L I R A L GAR 

A0CIG3IG3A OCTO0GAAO3 TEAQ3&IA&C GQSaKETdiVr GKTOOGSGKT 250 
PGG PPNV RIT GID DPRS 

Cm03mB2 TEGTCAAGSA G3ACTIGAGT TAGTIGGACA A£GACTTG33 300 
SFA RQG G L E L V G Q R I* G 

AAGCEAG3CG AAft2GIQ333 TCrriXXXSTIT GSGITCCATC GAGCTOCTIT 350 
K £i A E MCG V PF EFHG A A L 

ATQZIQCAQ3 GAAGflOSAAA AOSftGTEAGA. AOT33&GAAG 400 

CCT EVEI EKL GVR NGEA 

0QCT03033T T&ACTIODOG CTlUTiLTIC AOCPOWSOC TGAIGAGAGT 450 
LAV NFP L V L H HMP DES 

GTAACIGIQ3 AGAATCACfiG AGA3SGATIG TJSMSmXSG TCAAACACIT 500 
VTVE NHR D R L L R L V K H L 

GTCA03AAC GTB3K3&CIC T33TIGftGCA AG&ftQ33\KT ACAAACACIG 550 
SPN VVTL VEQ EAN TNTA 

CQCXETTTCT TXXD33GrTTT GIU3&GACAA TGAADCATT&. CTIGSZftGTT 600 
P F L PRF VETM NHY LAV 
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10 20 30 40 50 

1234567890 12345678 90 1234567890 1234567890 1234567890 

*IY\GMGIGRA ££TB33Cm^ GATCZO\AG3 650 
FESI DVK LAR DHKE RIN 

TCTIGAQCAG C&FB3ITIG3 CffiC3fti3«33r TCIGAATCTT ATRG Ori Gi U 700 
V E Q H C L A R E V V N L I A C E 

AAG3IGTIGA. A£GA£3l&^G AG3CAOGAGC 750 
GVE REE RHEP LGK WRS 

OG2TITCACA. TkA&XGlSfl? OCTTIGaGCT CCTEVTCT^A 800 

R FHM AGF KPY PLSS Y V N 

OQGAACAKrC A&AG3fiTK3C TIS^Gim-TIC2V3^fiG 3?0?0£TIG 850 
ATI KGLL ESY SEK YTLE 

AAGAAftG&Gft. TOGJ^GCKETG T&CTTAGSaT G3&AGAATCA AXlCrEVIC 900 
ERD GAL Y L G W KN Q PLI 

ACTiCTlGIG CTTQ3AOGm AOCITCTKG 950 

T S C A W R X 

AGarraGAAA crrcirmA agttig^aga ArciGrrror amagtataa 1000 



CICKEGCfilG AKXENAG3A ACA£GT1G'J.C AMIGTICTft. CTAGmfiGIG 1050 



ATRTOTIGST GACCCAAMA. A&AAAAAAAA. AAAA7V 1085 
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10 20 30 40 50 

1334567890 1234SS7flQ0 17.34567890 1234567890 1234567890 

GAGAGAAGAT GoI'XCKDSIG AT1GATCIOG A1G2TICIGA 50 
A M E G EKM V H V IDLD A S E 

GaCKGCXCAA TX32ITCa&GC TTITWOGT AQQDCHS&AG 100 

P A Q WLAL LQA F N S R P Et G 

GTCX3£CTCA TITCAG&AIC iOK33IUTIC iYICACCAGAA Q3AAGIQ2TT 150 
P P H LRI TGVH HQK EVL 

CICai&G&CT GCAGAGAAAC 2D3&IKICQC 200 

EQMA HRL IEE AEKL DIP 

GITICAGITr MICDOGTTG T3AGI&G3IT AGJOGiTTA. N<K*IP£Z&£2 250 
FQF. NPVV SRL DCD NVEQ 

AGTIGCG33r TAAAACAQ3A. GM33XmG 033TEAQCTC G3TICTICAA 300 
L R V KTG E A L A VSS VLQ 

TiQ^AiAarr tctiqxcic tc^igktgkt CICAIG&GAA. 350 

I* H T F LAS DDD L.MRK NCA 

TTEPiCGGTIT CAGAACAAOC CTAGraSftCT TC&CTIGCAG 400 
LRF QNNP SGV DLQ RVLM 

TCKIGAGDCA. GdG?iG3^C GIG^GAMGA. TMG^TmAC 450 

MSH GSA AEAR END M SN 

AACAKT93GT ATAG30CEAG 033IGACI03 GOl'lLmC'lT TOOCTlTftCC 500 
NNGY S PS GDS ASS L P L 

AAGTICAG3A. GCTCCCICAA TGL'IMTIQS Q3ITJLUUJ1C 550 

SSG RTD S FLN AIW GLSP 

C&AAG3ICAT G3IQGTCACT GAGCPAGACT CAGADCACAA 033ZDXACA 600 
KVM VVT EQDS DHN GST 



WO 00/53723 



17/101 



PCT/USOO/05875 



10 20 30 40 50 

1234567890 1234567890 1234557890 1234567890 1234567890 
CTftKEGSaGk ATCSO.T12AC C^ITSITIGA 650 

LMER LLE SLY T Y A A If F D 

TD3CTIG3AA. ACZ^&GTIC CaRGaAOSIC 0X3&GMDaG3 ATCaMGIGS 700 
CLE TKVP R T S Q D R 1 K Vt E 

AS^GATOCT CTI03333?iG G£G&IC&&GA ACMCAfmOX: 750 
KML FGE EIKN IIS CEG 

G&3&&W3&CA QSSiSREGCIET GAG&A&EQSl QOCftGBGG&T 800 
FERR ERH EKL EKWS QRI 

CX3KETIQ32T G3TITIQ33A. ATCITOdCT TftSCT&JEEftT GOGATOTIGC 850 
DLA GFGN VPL SYY A M D Q 

AGGCIftGGftG AT1UCT1CAA G33IG333TT TIG&D3330A 0&G?&TCft&G- 900 
ARR L Ij Q GCGF DGY RIK 

G3IQ332RGT AKiTlUL'lU3 C&&GAT03&C CICTMCACTC 950 
EESG CAV ICW QDRP LYS 

G3TA3CAGCT 1G3&GA3G2A. GGAAGIGAAT GKEAIftTEAC AQITIGICIT 1000 
VSA WRCR KX' 

CTATTITOGir TAIGAGGAGA. GTCOCTTICT TllTJLUlSflA CAIQ333CA 1050 



o^Krcmsr tctttigiga rasiGacrrr ckjicictit atozjuAlItit 1100 



QSCrmAAIG CTICERCIG^ CTCIQ2&D3T A^AGOCITIG TCTGTIQ3IT 1150 



CAATTIG3IC 1G3IGKX33T G02ATACCAA ACCAAA3TCA. ATTIGAQZTG 1200 



AAGMCARC3A ATTIGAIGKT 03321021X30 C 3231 
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CTTTGTCAA.T GGTAAATGAG CTGAGGCAGA TAGTTTCTAT CCAAGGAGAC SQ 
CCTTCTCAGA GAATCGCAGC TTACATGGTG GAAGGTCTAG CTGCAAGAAT 10 q 
GGCCGCTTCA GGAAAATTCA TCTACAGAGC ATTGAAATGC AAAGAGCCTC 1SQ 
CTTCGGATGA GAGGCTTGCA GCTATGGAAG TCCTGTTTGA AGTCTGCCCT 2 00 
TGTTTCAAGT TCGGGTTTTT AGCAGCTAAT GGTGCGATAC TTGAAGCAAT 2 SO 
CAAAGGTGAA GAAGAAGTTC ACATAATCGA TOTCGATATA AACCAAGGGA 300 
ACCAATACAT GACACTGATA jCGAAGCATTG CTGAGTTGCC TGGTAAACGA 350 
CCTCGCCTGA GGTTAACAGG AATTGATGAC CCTGAATCAG TCCAACGCTC 400 
CATTGGAGGG CTAAGAATCA TCAATCTAAG ACTCGAGCAA CTCGCAGAGG 4S0 
ATAATGGAGT ATCCTTCAAA TTCAAAGCAA TGCCTTCAAA GACTTCGATT 5O0 
GTCTCTCCAT CAACACTCGG TTGCAAACCA GGAGAAACCT TAATCAGTGA SS0 
ACTTTGCATT CCAACTTCAC CACATGCCTG ACGAGAGTGT CACAACAGTA 6 O0 
AACCAGCGGG ACGAGCTACT TCACATGGTC AAAAGCTTAA ACCCGCTTGT 650 
CACGGTCGTT GAACAAGACG TGAACACAAA CACTTCACCG T4TCTTTCCCA 700 
GATTCATAGA GGCTTACGAA TACTACTCAG CAGTTTTCGA.GTCTCTAGAC 750 
ATGACACTTC CAAGAGAAAG CCAAGAGAGG ATGAATGTAG AAAGACAGTG 800 
TCTCGCTAGA GACATAGTCA ACATTGTTGC TTGCGAAGGA GAAGAACGGA 8 50 

TAGAGAGATA CGAGGCTGCG GGAAAATGGA GAGCAAGGAT GATGATGGCT 900 
GGATTCAATC CAAAACCAAT GAGTGCTAAA GTAACCAACA ATATACAAAA 950 

CCTGATAAAG •CAACAATATT GCAATAAGTA CAAGCTTAAA GAAGAAATGG 1000 

GTGAGCTCCA TTTTTGCTGG GAGGAGAAAA GCTTAATCGT TGCTTCAGCT 1050 

TGGAGGTAAG ATAAGTGACA AG AG CAT ATA GTCTTTATGT TTCATAAAAC 11O0 

ATAATTATGT TTTTACTGTA ATCTTGGGTT ATTGTGTAAC TGGTTAAATC 1150 

ATCTCCATGT ATT ATT AC C A GAGGTTAGGG GTGATCACAG GTACTAAAAG 12 OO 

CTAATCTAAC ACTTATGGAA GAATTTTTCT TTCTTTTTTT TCCCTATTAT 1250 

ATAAAAATAA TTAGAGTTTT GGTTCTAAAC CTATTTGCTA AGTGTGAATG 13 OO 

AGTCTTTACA TGTTCATATT TCAGTTCAAA TGGTTAAATT TGTTAAGGTT 13 50 
CTCACTTAAA AAAAAA 
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Zm-scll 

10 20 30 40 50 

CCAGGAGGCGTTCGAGCG<JGAG<3AGCGTGTGCACATCATCGACCTCGA(^ 
QEAFEREERVHITDLD I 

60 70 80 90 X00 

TCATGCAGGGGCTGCAGTGGCCGGGC^^ 
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GOP P K L RITG I G. P NPI 
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AOOD33IG QGGMiaaCT' TCACTTPXA. G3^GICTCT 250 
HSV GIDF TFQ GVC VDQL 

TIGATftasrr GIGOS&CIGG ATOCTTC3CA AMT&ftlC&A A03AG?O3Cft. 300 
DRL CDW MLLK PI K GEA 

GTIQCOfflftA ACIOCftTOCT ACA&ClDCftT 03QCIOTI03 TIGAOOCfcGk 350 
VAIN S I L QLH RLLV DPD 

TGCPAbCCCA GTGGIGCCCG CAQCAKEAGA. TATOCTOCIC AAATIG3TCA. 400 
ANP VVPA PID ILL KLVI 

ICA&GKmAA OCG2A1G&1C TICA033TG3 TK^G2MK3A G3CAGATCAC 450 
KIN PMI FTV V EHE ADH 

A&CAG&OC&C CP£n2£IbGh GAQSTICACr AKIQCXCICr TOCZOMGC 500 
NRPP LLE .RFT NALF H Y A 

GAOCA3X3ITT GMTICTTIG3 AG3XATOCA TOSTTCEftCC 550 
TMF DSL E AMH RCT SGR D 

AOOTADCGA CTCACTCACA. GAG3IGERCC TI03&33IGft. GAITTTTGAC 600 
ITD SLT E VYL RGE IFD 
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H W R ERLT YAG LTQ VWFD 

A3CXXX3KIGA OiJrilafOOS CTZ^fiGRCC AGriGMULA TCflGACKKX: 750 
PDE VDT LKDQ LIH VTS 

eiUUSTICftft. OOCCTftGIG TCIGftSGGCA. GXTIGCZO? 800 
LSGS GFN ILV CDGS LAL 

AGCGIGGCftT AKKXSOCXDST TATAIGTOX A^CftGCTIGS TCTCIGftCftG 850 
A W H N R P L YV A TAW CVTG 

GAG3AAAT3C TGOCftGTICX: AT33IT33CA ACATCIGIAA G33320AAT 900 
GNA ASS MVGN ICK GTN 

GATAGfEAGAA GJWiGS^A. 0CCKIG3AGT AQC&QSftftGa. 950 

DSRR KEN RNG PMEX 

ATAAOCATCT CA3GAGCMA TCGftTCAAGT AATAAAMGC ACIGMtSOV 1000 



TaCftTOGIGA TCE^AASTTT TTITGCGIGA. ATCflGCRMG A03AMT3IT 1050 



C&ATITGAAT AfiOCTAATCA TGAGACICftA AAAA&AftAftA A&A 1093 
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CCCAACTTGG GAAGCCCTTC CTCCGCTCCG 
CTCCTCCTCG CACTCGCCGA CAGCCACCAT 
GCCGCTCGAC GTTGCCCTCA AGCTTGCAGC 
TGTCACCTGT GCTCCAGTTC ACTAACTTTSA 
ATGAGATTGG TGGCATGGCA ACTTCCTGCA 
CTCGGTGTTG GTGGTCAGTG GGCTTCCTTC 
CCGGGGAGCT GGAGGTATGG CCTTGCCGTX 
TGTCGACTGC TTCTCACCAT CCACTGGAGC 
CTCTCTCAGT TTGCCGCAGA GCTCAGAATT 
CAGTCTTGAT GCATTCAATC CTGCGGAATC 
AAGTTGTTGC TGTTAGCCTC CCTGTTGGCT 
CTGCCAGCGA TTCTTCGGTT GGTGAAACAG. 
GGCTATTGAT C 



CCTCCTACCT CAAGGAGGCC so 

GGCTCCTCCG GCGTCACCTC XOO 

ATACAAGTCT TTCTCTGACC 150 

CCGCAACAAG GOGCTTC5TG 200 

TCCATGTCAT TGACTTTGAT 2 50 

TTGCAGGAGC TTGOCC&qCG 300 

GTTGAAGCTC ACGGCTTTCA 350 

TGCACCTTAC CCAGGATAAC 400 

CCTTTCGAAT TCAATGCCGT 450 

TATTTCTTCC TCTGGTGATG 500 

GCTCTGCTCG TGCACCACCG 550 

CTTTGTCCTA AGGTTGTCGT 600 



Fig-. 12.A 



WO 00/53723 



23/101 



PCTAJS00/O5875 



Tnnwm tttttttttt tttttttttt tacagagcaa cagcagtata 50 

ATATTAATTC TGTACCACAC AACCATTTGA TAGGTTAAAT TACCCTCTAG 100 

TCTCTACTCA TAAGCAGTGT TTCCAATGAG ATGATCATGG CTAATTGAGC 150 

AGAGCATGGC AACAACCTAA AGCAACATCA TTAGCTAIAG AGACTGACAC 2 00 

CAATATTCCT AAATCCACTA GGCTAGCTAA TAAGCTGCAA CGAAAAGCAA 2 50 

TATGAAGAGT TCAACAGCTC AAGACAACAA TTTCATCTGC AACATTTAAT 3oo 

TGCAAGAATA AATGGACATT ACTGGAGTGG TCGATGCTXG CAAACGGTGG 350 

TGGAACCTTG GTGGAGTGAA GCTTATGGCT GATCAGCACC GCCAAGATGA 400 

TATGGATACA AGCTCCCCAC GCTGCCAGTA GAGCGTAAGA GCAGCTCCGC 450 

GTTTCTCCAC ATGGAATCCT CGGACCTGCA CCCGCTTCAG GAGGCAGTCT 500 
GC 
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FIG. 15C 



Scr 

3989 

12398 

4871 

11846 
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3989 • 
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11846 

2S04 . . . i 
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23196 CMFHDAIALQ AAEKSLVEM. GEKDPSSSSA SSVDHPERIA SHSPDGSCSG 
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21729 
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33/08 TSDSA SSFNIPTSRQ NHYATGSFST 
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Scr EKSSNPDTON SSRPPRRVSH LLDSHYNTVT PQQPPSLTAA ATVSSQPNPP 
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3935 i 

11261 . ; 
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23196 GAFSDYASTT TTTSSDSHWS VDGLENRPSW LHTPMPSNFV FQSTSRSNSV 

Tfl .MKRDHHQFQ GKLSNHGTSS SSSSISKDKM MMVKKEEDGG 

Tf4 HKHDHHHHH QDKK TMMMNEEDDG 

18310 

18652 

4818 

21729 

1110 

174 

33/08 NSRTTNVATA TTNSATAHWV ATDAEHTDTI IAQP 
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FIG. 15F 



Scr LSVCGFSGLP VFPSDRGGRN VMMSVQPMDQ OSSSSSRSPT VWVDAIIRDL 

3989 

12398 

4871 ; 

11846 

2504 

3935 
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713 

10964 

23196 TGGGGGGNSA VYGSGFGDDL VSNMFKDDEL AMQFKKGVEK ASKFLPKSSQ 
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Tf4 NGM. DELIA V LGTKVHSSEH ADVAQKLEQL EVHHSNVQED DI^QLATETV 
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21729 D 

1110 i..... 
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Tf4 HYHPAELYTW LDSMLTDLNP P....SSHA ETOLK 
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186S2 

4818 

21729 LTSVNDMSLF GGSGSSQRYG LPVPRSQTQQ QQSDXGLFGG IRMGIGSGIN 
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|< bZIP like domain »| 

f< Motif II (dlnterisation) 

Sec PQPETVTATV PAVQTNTAEA LRERKEEIKR QKQDEEGLHL LTLIXQCAEA 
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4871 . AAIFYG HHHHTPPPAX RLNPGPVGIT 
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2S04 
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713 
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Tf 4 . . .H5WE TTTA TAESTRHWL VDSQENGVRi, VHALLACAEA 
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21729 ITKSMSDWIQ NLVTPNPHPN PVLSFSPSSS SSSSSPSTAS TTTSVCSRQT 
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Motif II (dimerixation) >| 

Scr VSADNLEEAN KLIXEISQLS TPYGTSAQRV AAXFSEAMSA RLLNSCLGIY 

3989 

12398 

4871 EQLVKAAKVI ESDTCIAQGH. ARLNQQLSS PVGKPLERAA FYFKEALNNL 

11846 

2S04 

393S 

11261 

713 

10964 ; 

23196 VSVDDRRTAH EKLRQIREHS SPLGNGSEFX AHYFANSLEA RLAGTGTQIY 

Tf 1 IQQNNLTLAE ALVKQIGGLA VSQAGAMRKV ATYFAEALAR RIYRLSPPQM 

Tf4 VQKENLTVAE ALVKQIGPUV VSQIGAMRQV ATYFAEALAR RIYRI^PSQS 

18310 

18652 

4818 CT 

21729 VMEIATA1AE GKTEIATEIL ARVSQTPNLE RHSEEKLVDF KVAALRSRIA 

1110 . . .LSMVKEL RQIVSIQGDP SQRIAAXMVE GIAARMAASG KFIYRALKCK 
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|< Motif III (SCR VHIID) 

Scr AALPSRWMPQ THSLXMVSAF QVFNGISPLV KFSHFTANQA IQBAFEKEDS 

3989 LYRNKAU, DEIGGMATSC 
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4871 LHNVSQTLSA CSLIFKVAAY KSFSEISFVL QFAKFTSNQA LLESFHGFHR 

11846 
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3935 .-. . AMEGEKM 
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713 
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23196 TALS...SKK TSAADMLKAY QTYMSVCPFK KAAIIFAHES MMRFTANANT 

« 1 QIOHCLSDT LQ MHFXETCPXX. KTAHFTANQA XLEAFEGKKR 

Tf 4 PIDHSIOTT LQ MHFXETCPYI, KFAHFTANQA IZtEAFQOKKR 

18310 EA SVKGTN...H 

186S2 AHVB ILEAIAOBTR 

4818 SPTCPELLT YM HILYEACPXT KFGYESANGA XAKAVKNESF 

21729 SPVTELYGKE HLISTQI.... . -LYELSPC? KLGFEAAKLA 3XDAADNNDGGMMI 

1110 EPFSDERLA. AH QVLFEVCPCF KFGFLAANGA 3XEAIKGEEE 
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— Motif III (VHIZD) >|< Motif IV 

Sec VHIIDLDIKQ CLQWPCLFHI LASRPCGPPH VRLTGLGTSM EA. LQ 

3989 IHVIDFDLGV GGQWASFLQE LAHRKGAGGM ALPIiKLTAF MSTASHHPLE LH 

12398 

4871 iailDFDIGY GGQWASLMQK X,VLRDHAAPLSLKITVTASPA NHVQLELG. . 

11846 

2S04 

3935 VHVTDLDASE PAQWLAIXQA FKSRPEGPPH LRITGVHHQK EVLE 
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713 

10964 

23196 IHIIDFGISY GFQWPALIHRLSLSRPGGSPK UUTGIELPQ RGFRPAE. . . 

Tfl VHVTDFSKNQ GLQHPALMQA IALREGGPPT FRLTGIGPPA PDHSDHLH.. 

Tf 4 VHVTDFSMSQ GLQWPALMOA IALRPGGPPV FKLTGIGPPA PDNFDXUI. . 

18310 VHIIDFSLMQ GLQWPALKDV FSAREGGPPK UUTQIGPHP IGGRDELH. . 

18652 VHIIDFQIAQ GSQYKFLIQE LAKRPGG PPJXKVT GVDDSQST3fARGGGI>S 

4818 VHIIDFQISQ GGQWVSLIBA LGARPGG PPNVRIT GIDDPRSSFARQGGLE 

1110 VHIIDFDINQ GNQrKTLIRS IAELPGK RPRLRLT GXD DPES VQRS XCGLR 

21729 PHVTDFDIGE CGQYVNLLRT tSTRRNGKSQ SQNSPWKIT AVANNVYGDCLVDDGGEERLK 
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MOtif I V »| 

Scr ATGKRLSDFT DKLGLPFEFC PLAEKVGNDL TERLNVRKRE AVAVHWL . . . 

3989 LHLTQOKLSQ FAAELRIPFE FNAVSLDAFN PAESISSSGD EWAVSL- . . 

12398 

4871 FTQDNTJCHFA SEINISLDIQ VL..SLDLLG SISWPNSS.. EKEAVAVNIS 

11846 

2504 NGGAF APSTWTA... 

3935 CMAHKL1EEA EKLDIPPQFN PWSRLOCLN VE...QLRVK TGEALAVSSV 

11261 K KWETITLDEI. MINPGETTW 
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10964 

23196 EFRRQVIAWL DTVSDTMFRL STTQLLRNGE TIQVEDLKLR QGEYWVNSL 

Tf 1 EVGCKLAQLA EAZEVEFEYR GEVANSLADL DASMLELRPS DTEAVAVNSV 

Tf 4 EVGCKLAHLA EAIHVEFEYR GFVANTLADL DASHLELRPS EIESVAVNSV 

18310 EVOIRLAKXA HSVGIDFTFQ GVCVDQLDBL CDWML.LKPI KGEAVAIKSI 

18 652 LVGERLATLA QSCGVPFEFH D. . .AXMSGC KVQREHLCLE PGFAWVNFP 

4818 LVGQRLGKLA EMCGVPFEFE G...AALTCT EVEIEKLGVR KGEALAVNFP 

21729 AVGDLLSQLG DHSISVSFNV V...TSLRLG DLNRZSLGCD PDETIAVNIA 

1110 I1GLRLEQLA EDNGVSFKFK A. • . MPSKTS IVSPSTLGCK PGETLXVNFA 

174 

451 500 
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Scr ...QHS 

3989 P VO i 

12398 

4871 AA 
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3935 LQLHTFIASD DDLHRKNCAL RFHNNPSGVD LQRVXMHSHG SAAEARENDK 

11261 NCIHRLQrTP DE - - 

713 

10964 

23196 FRFRHH.... DE 

T£l FELHKLLGRX GO 

Tf 4 FEXHKLLCRP GA . 

18310 LQLHRU.VDP OA 

18652 YVLHHK...P DE 

4818 LVLHHM...P DE , 

21729 FKLCTV...P DE..., 
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Motif V >| |< Motif VI 

Scr LXDVTGSD AHTLWU^QRL, apkwtweq dlshags.fi. 

3989 CS ARAPPL PAILRLVKQL CPKVWAIDH GCDRADLPFS 

12398 

4871 SFSHLPLV LRFVXHI^SPT ZZVCSORGGE RTDLPFSQQI. 

11846 Q EADHNKTCFL 

2S04 .NGGAFAPST WTARSLPVPSSPST DSF 

393S SKNNGYSPSG DSASSLPLP SSGRT DSFLHAIHGI. SPKVMWTEQ DSDHNGSXLK 

11261 TVSLDSPR DTVLJXFRDI NPDLFVFAEI NGKYNSPFFM 

713 NGSYNAPFFV 

10964 AXNAPFFV 

23196 TVLVNSPR DAVLKLIRKI NFNVFIPAXL SGNYNAPFFV 

Tfl .... , I EKVLGWKQD TGDFKXWXRQ EPNHNGPGFL 

Tf4 1" DKVLGWNQI KPEIFTWEQ ESKHNSPIFL 

18310 HPWFAPI DUXICLVUCI NPKIFTWEH EADHMRPPLL 

18652 SVSVEKXR DRIXHLIKSL SPNLVTLVEQ EDNTNTSPLV 

4818 SVTVENHR DRLLRLVKHL SPHWTLVEQ EANTKTAPFZ. 

21729 SVCTENPR DEIXRRVXGI. KPRWTLVEQ EHNSNTAPFL 

1110 SVTTVNQR DBLLHMVKSI. KPKLVTWEQ DVHTNTSPFF 
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Motif VI 

GRFVEAIHXY SKLFVSLG&S Y..GEESEER HWEQQLLSK EIRNVXAVGG 
QHFLNCFQSC VFLDSLDAAG I. .DADS A. . CKIERST.IO.P RVEDAVTO . . 

SLEPN L. .DRDS7XR UWERVLFGR RIMDLVRSDD 

AHSLHSHTAL FESLDAVNAH T,. .DAM QKIERFT.IQP EIEKLVLD.. 

DRFTEALBTY SAVFDSLDAA K. .NNNNNNN QRMEAEYXQR EICDrVCGEO 
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SAIFDKLETN 
SS IFDMLETI 
SAVFDMCOSK 



V. .PRTSQDR 
IHAEDEYKNR 
I..FKDNEQR 
V. .PREDEER 



AIHFDSLEAK 
TAMFESIDAA 
IAVFESIDVK 
CALLESVKST 
SAVFESLDMT 



R. . PRDDKQR 
It. .ARDHKER 
V..PSTNSDR 



IKVEKMLFGE 
SIXERELLVR 
IX1ESALFSR 
MIXEHEVFGR 
LMXVFEPYGR 
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Motif VI ►! 

Scr . SDGXTLVD . DNGTLBXGWK DLSLLTASAW TPRSX 

3969 VRGFH..VEK RGAALTLTHQ RGELVSISSW RCX 

12398 YSTLYSLVES EPGFISIJVWN NVPLLTVSSW RX 

4871 "VRGFH..VEE KHNSLLLCWQ RXELVGVSAW RCRSSX 

11846 " 

2S04 

393S FDGYR..1KE ESGCAVTCWQ ORPI.YSVSAW RCRKX 

11261 RXHRDFVIDS DNNWMLQGWK GRVIYAFSCH XPAEKFTNNN LNIX 

713 LPTRTFIIDE DNRWIXQGHK GRILFALSTW KPONRSSSX 

10964 FTHKDFVIDQ DNRWLLQGWK GRTVMALSVW KPESX 

23196 CYDKNFDVDQ NGNWLLQGWK GRXVYASSLW VPSSSX 

Tfl NSGQGXRVEE SNGCLMLGHH TRPI.ITTSAW KLSTAAHX 

Tf 4 NGGEGYRVEE SDGCLKLGWH TRPLIATSAW KLSTNX. 

18310 .SGSOFNILV CDGSIAIAWH NRM.TVATAW CVTGGKAASS MVGN1CKGTN 

186S2 ATOKNYKLGG HEGALYLFWK RRFMATCSVW KPNPNYIGX 

4818 SISEKTELEE RDGALXXGMK NQPLXTSCAW RX 

21729 VHPG.FTVKE DNGGVCFGHM GRALTVASAW RX 

1110 CjYCNKXKLKE EMGELHFCWE EKSLIVASAW RX 

174 DXCSKXICVKE EKGELHFSHE EKSLtVASAW SX 

701 750 
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FIG . 16A 



SRPal ( 11101 

CTTTGTCAATGGTAAATGAGCTGAGGCAGATAGTTTCTATCCAAGGAGACCCTTCTCAGA 

GAATCGCAGCraACATGGTGGAAGGTCTAGCTGCAAGAATGGCCGCrTCAGGAAAATTCA 

TCT ACAG AG CATTG AAATG CAAAG AG C CT CCTTCGG ATG AG AGG CTTG CAG CT ATG CAAG 

TCCTGTTTGAAGTCTGCCCTTGTTTCAAGTTCX3GGTTTTTAGCAGCTAATGGTGCGATAC 

TTGAAGCAATCAAAGGTGAAGAAGAAGTTCACATAATCGATTTCGATATAAACCAAGGGA 

ACCAATACATGACACTGATACGAAG(^TTGCTGAGTTGCCIX3GTAAACGACCTCGCCTGA 

GGTTAACAGGAATTG ATGACCCTGAATCAGTCCAACG CTCCATTGGAGGGCTAAG AATCA 

TCGGTCTAAGACTCGAGCAACTCGCAGAGGATAATGGAGTATCCTTCAAATTCAAAGCAA 

TGCCTTCAAAGACTTCGATTGTCTCTCCATCAACACTCGGTTGCAAACCAGGAGAAACCT 

TAATAGTGAACTTTGCATTCCAACTTC^CCACATGCCTGACGAGAGTGTCACAACAGTAA 

ACCAGCGGGACGAGCTACTTCACATGGTCAAAAGCTTAAACCCAAAGCrTGTCACGGTCG 

TTGAACAAGACGTGAACACAAACACTTCACCGTTCTTTCCCAGATTCATAGAGGCTTACG 

AATACT ACT CAG CAGTTTT CG AGTCTCTAG ACATG ACA CTTCCAAGAGAAAG CCAAG AG A 

GGATGAATGTAGAAAGACAGTGTCTCGCTAGAGACATAGTCAACATTGTTGCTTGCGAAG 

GAGAAGAACGGATAGAGAGATACGAGGCTGCGGGAAAATGGAGAGCAAGGATGATGATGG 

CIXMATTCAATCCAAAACCAATGAGTGCTAAAGTAACCAACAATATACAAAACCTGATAA 

AGOVAC^TATTGCAATAAGTACAAGCTTAAAGAAGAAATGGGTGAGCTCCATTTTTC 

GGGAGGAGAAAAGCTTAATCGTTGCTTCAGCTTGGAGGTAAGATAAGTGACAAGAGCATA 

TAGTCTTTATGTTTCATAAAACATAATTATGTTTTTACTGTAATCTTGGGTTATTGTGTA 

ACTGGTTAAATCATCTCCATGTATTATTACCAGAGGTTAGGGGTGATCACAGGTACTAAA 

AGCTAATCTAACACTTATGGAAGAATTTTTCTTTCTTTTTTTTCCCT 

AATTAGAGTTTTGGTTCTAAACCTATTTGCTAAGTGTGAATGAGTCTTTACATGTTCATA 

TTTCAGTTCAAATGGTTAAATTTGTTAAGGTTCTCACTTAAAAAAAAA 
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FIG. 16B 



SRPa3 (393S) 

GCTATGGAAGGAGAGAAGATGGTTCATCTGATTCATCTCGATGC^ 

TGGCTTGCTTTGCTTCAAGCTTTTAACTC^ 

ACTGGTGTTGATCACCAGAAGGAAGTGCTTGAACAAATGGCT 

GCAGAGAAACTCGATATCCCGTTTCAGTTTAATCCCXST^ 

AATGTAGAACAGTTGCGGGTTAAAACAGGAGAGG<Xr!rTAGC(X3T^^ 

TTGCATACCTTCTTGGCCTCTGATGAO^ 

CAGAACAACX2CTAGTGGAGTTGACTTGCAGAGAGTTCTAATGATGAG<X^ 

GCTGAGG<^CGTGAGAATGATATGAGTAACAACAATGGGTATAGCCCTAGCGGTGACTCG 

GCCTCATCTTTGCCTTTACCAAGTTCAGGAAGGACTG&TAGCT^ 

GGTTTGTCTCCAAAGGTCATGGTGGTCACTGAGCAAGACTCAG 

CTAATGGAGAGGCTATTAGAATCACTTTACACCTACGC^^ 

ACAAAAGTTCCAAGAAOTTCTCAAGATAGGATCAAAGTGGAGAAGATGCTCTTCGGGGAG 

GAGATCAAGAACATCATATCCTGCGAGGGATTTGAGAGAAGAGAAAGACACGAGAAGCTT 

GAGAAATGGAGCCAGAGGATCGATTTGGCTGGTTTTGGGAATGTTCCTCTTAGCT 

G CG ATGTTGCAGGCTAGG AG ATTGCTT CAAGGGTGCGGTTTTGATGGGTATAGAATCAAG 

G AAG AG AG CGGGTG CGCAGTAATTTGCTGG CAAGATCG ACCT CTATACTCGGTATCAGCT 

TGGAGATGCAGGAAGTGAATGATATATTACAGTTTGTCTTCTATTTTGGTTATC 

GTCCCTTTCTTTTTTGTATACATGGGGACAC&ATCTTAG^ 

CTGTCTCTTT ATG CTATTTTGGCTTAAATG CTTCTACTG CCTCTG CATGT AAAGCCTTTG 
TGTGTTGGTT CAATTTGGTCTGGTGTGGGTGT AAT ACCAAAC CAAATCCAATTTG AG CTG 
AAGATAACTAATTTGATGATCGGCTCGTGCC 
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FIG . 16C 



SRPa4 (4818> 

ggcacgagcccaacgggtcctgagcttcttacttatatgcatatct^ 

ccttatttcaaattcggttatgaatctgctaatggagctatagctgaagctgtc 

gaaagttttctgcacattatckjatttccagat^^ 

atccgtgctcttggtgctagacctggtggacctccgaaogttaggataa^kmaattgat 

gatccgagatc^tcgtttgcrcgtcaaggaggacttgagttagttc 

aagctagctgaaatgtgcggtcttccgtttgagttc^ 

gaagtcgaaatcgagaagctaggagttagaaatggagaagcgctcgcggttaacttc6cg 

cttgttcttcaccacatg<x?roatgagagtgtaactgtggagaatcacagagatagattc 

ttgagattggtcaaa<^cttgtcaccaaacgttgtgactctggttgagc^ 

acaaacactgcgccgtttcttccccggtttgtcgagacaatc^ 

ttcgaatcaatagatgtgaaactcgctagagatcacaaggaaaggatcaatgttgagcag 

cattgtttggctagagaggttgtgaatcttatagcttgtgaaggtgttgaaagagaagag 

aggcacgagccactagggaaatggaggtctcggtttc^^ 

cctttgagctcgtatgtgaacgcaacaatcaaaggattgcrtgagagttattcagagaag 

tatacacttgaagaaagagatggagcattgtatttaggatggaagaatcaacctcttatc 

actt cttgtg citgg aggtaactaataaaaaccrttgracx^tttcag aag ag attag aaa 

cttcttttaaagtttgcagaatctgtttgtaaaagtaaaactcatgcatgatccgnagga 

acaagttgtcaaatgttgtagtagtaagtgatatgttga^acccaaaaaaaaaaaaaaa 

AAAAA 
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FIG. 16D 



SRPaS (48711 

GCMGCTATCTTCTACG<WCACCACCACCATACACCTCCGCCGGCA&AGCGGCTCAACCCr 

GGTCCCGTGGGGATAACAGAGCAGCTGGTTAAG<K»GCAGAGGTCATAGAGAGCGACACG 

TGTCTAGCTCAGGGGATATTGGCGCGGCTCAATCAA<^GClX?rCTTCTC^ 

CCATTAGAAAGAGCAGCITTTTACTTGAAAGAAGC^ 

TCCCAAACCCTAAACCCTTATTCCCTCATCTTCAAGATCGCTGCTTAC^ 

GAGATCTCT<X:C«TTCTTCAGTTCGCCAACTTTACCTCC^ 

TTCCATGGCTTCX^CCGTCT<XACATCATCGACTTCGATATCGGCT 

GK^CCCTCATGCAAGAGCTTGOTCTCCGCGACAACGCCGCT 

ACCGTTTTCGCFrCTCCGGCGAACCACGACCAGCTCGAACTTC 

CTCAAGCACTTCGCCTCTGAGATCAACATC^CCCTTGACATCCAAGTTTTGAGCTTAGA 

CTCCTCGGCTCCATCTCGTGGCCTAACTCXrrCGGAGAAAGAAGCTGT<^CCGTTAAC^TC 

TCCGCCGOSTCCTTCTCGCACCTCCCTTTGGTCC^^ 

ACGATCATCGTCTGCTCCGACAGAGGATGCGAGAGGACGGATCTGCCCTTCTCTCAA 
CTCGCCCACTCGCreCACTCACACACCGCTCTCTTCGAATCCCT 

AACCTCGACGCAATGCAGAAGATCGAGAGGTTTCTTATACAGCCGGAGATAGAGAAGCTG 

GTGTTGGATCGTAGCCGTCCGATAGAAAGGCCGATGATGACGTGGCAAGCGATGTTTCTA 

CAGATGGGTTTCTCACCGGTGACGCACAGTAACTTCACGGAGTCTCAAGCCGAGTGTOTA 

GTCCAACGGACGCCAGlXSAGAGGCTTTCACGTCGAGAAGAAAGATAACTCACrTCTCCTA 

TGTTGGCAAAGGACAGAACTCGTCGGAGTTTCAGCATGGAGATGTCXrcTCCTCCTGATTT 

CCACCGGAGTTTCAATTATTAAAAAAATATTTTCCTTAATTC 

AATTTTTAGTTTCTGATTTTATTTTGCTCAGTGC^ 

ACAAATATATAAATTTTTG 
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FIG. 16E 



BRPa6 (12398) 



AATCGCTTGAACCGAATTTGGATCGAGATTCGAAAGAAAGGCTGAGAGTGGAGAGAGTGC 

TGTTOaGTAGGAGGATTATGGATTTGGTCCGATCAGATGATGATAATAATAAACCGGGAA 

CCCGGTTTGGGTTAATGGAGGAGAAAGAACAATGGAGAGTGTTGATGGAGAAAGCTGGAT 

TTGAGCCGGTTAAACCGACTAATTACGCGGTTAGGCAAGCGAAGCTGCTACTATGGAACT 

ACAATXATAGTACATTGTATTCACTTGTTGAATCGGAGCCAGGTTTCATCTCC^ 

GGAAC^TGTGCCTCTCCTCACCGTTTCCTCTTGG 

AATCTAGTATTTTGAGTTAGCTTTTAGAAXTGW 

AATTAGTCTCTAGCCTATTCTCTTACTCTTTTTTGTCTAGTGCTTGGAGTGATGATGGOT 
TGTCGTTTATGTTCATTTGTAATATATATTGTATGTAACATTTGACTAAAAAAAAAAAAA 
AAAAAAAA 
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FIG. 16F 



SRPa7 (21729 /363S/17410) 

AAAGACTTTAGCAGATTTTCAAGCGGCTCAGAACATCAACAACAACAACAACAACAACCG 
TTTTATAGTCAAGCAGCTCTCAACXSCTT^ 
CAGAATTTTCAATCTCCGTCGGCCGATGATOH^ 
GTTTGGTGGTTCTCGTTCATCTCAGCGTrACXSGTT^ 

ACAGC^VCAATOGGATTACGGTTTATTTGGTGGGATCCGAATGGGAATCGGGTCGGGTAT 

TAATAATTATCCAACATTAACCGGCGTTCCGTGTATTGAACOGGTTCAAAACCGGGTTCA 

TGAATCGGAGAACATGTTGAATAGTTTAAGAGAGCTTCAGAAACAGCTTTTAGATGATGA 

CGATGAGAGTGGTGGTGATGATGACGTGTCAGTTATAACAAATTCAAATTO 

TCAAAATCTCGTGACTCCGAACCCGAACCCGAACCCTGTTTTGTCTTTTTCACCGAGCTC 

TTCTTCTTCGTCTTCTTCGCCTTCTACAGCTTCGA 

AACGGTTATGGAAATCGCGACGGCGATCGCGGAAGGGAAAACAGAGATAGCGACGGAGAT 
TTTGGCGCGTGTTTCTCAAACGCCTAATCTTGAGAGGAATTCAGAGGAGAAGCTrGTTGA 
TTTCATGGTGGCTGOSCTTOSATCGAGGATAGCTT^ 
GGAGCATTTAATCTC<IACTCAATTGCTCTACGAGCTCTCT 

CGAGGCCGCGAATCTCGCCATTCTCGACGCCGCCGA7!AACAACGACGGTGGAATGATGAT 
ACCGCACGTTATCGATTTCGATATCGGAGAAGGTGGACAATACGTrAACCTTCTCCGTAC 
ATTATCCACGCGCCGGAATGGTAAAAGTCAGAGTCAGAATTCKXX5GTGGTTAAGATCAC 
CGCCGTGGCGAACAAOGTTTACGGATGTTTAGTCGATGACX3GTGGAGAAGAGAGGTTAAA 
AGCCGTCGGAGATTTGT3X5AGCCAACTCGGTGATCGACTCGGTATCTCCGTAAGTXTCIAA 
CGTGGTGACX^GTTTACGACTCGGTGATC^rcJAATCGTGAATCTCTCGGGTGTGATCCCGA 
CGAGACTTTGGCTGTGAACTTAGCTTTCAAGCTTTATCGTC 

CACGGAGAATCGAAGAGACGAACTTCTCCXaGCGCGTGAAGGGACTTAAACCGCGCGTGGT 
TACTCTAGTGGAGCAAGAAATGAATTCGAATACGGCGCCGTTTTTAGGGAGAGTGAGTGA 
GTCATG CGCGTGTT ACGGTG CGTTG CTTG AGTCGGT CG AGT CTACGGTTC CT AGT ACG AA 
TTCCGACCGTGCGAAAGTTGAGGAAGGAATTGGCCGGAAGCTAGTAAACGCGGTGGCGTG 
CGAAGGAATCGATCGTATAGAGCGGTGCGAGGTGTTCGGGAAATGG<^AATGCGGATGAG 
CATGGC1XKX3TTTGAGTTAATGCCATTGAGTGAGAAGA^ 

TGGAAACCGAGTCCACCCGGGCTTTACCGTTAAAGAAGATAACGGAGGTGTGTGCTTTGG 

TTGGATGGGACGGGCACTCACTX3TCGCAT<X^CTTGGCGTTA^ 

TTTCTTCTTATTATTACCATATTATTATTAATTTTCGA^ 

TTGTGATTTTCCGTTTCGAAAAGTGTAGX3AATCTTATGTAACAAAGAAAAAAAAAAGACT 
TTTATGTTTTTCTAATAATAAAAGAAAGAGTGATTGGGTTCAAAAAAAAAAAAAAAAAAA 
AAAAAAAA 
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FIG. 16G 



SRPaB (109641 

TGCATACAACGCACCGTTTTTCGTAACACGGTTT CGCGAAG CTCTATTTCATTTCTCCTC 
GATTTTTG ACATG CTTGAGACAATTGTGCCACGAGAAGACG AAGAGAGG ATGTT CCTTG A 
GATGGAGGTCTTTGGGAGAGAGGCACTGAATGTGATTGCTTGCGAAGGTTGGGAAAGAGT 
GGAGAGK3CCTGAGACATACAAGC2VGTGGCACGTACGGGCTATGAGGTCAGGGTTGGTGCA 
GGTTCCATTTGACCCAAGC\TTATGAAGACAT<^^ 

CAAGGATTTTGTGATCGATCAAGATAACCGGTGKjOTCTTGCA&GGCTGGAAGGGAAGAAC 

TGTCATGGCTC^rrTCTGTTTGGAAACCAGAGTCCAAGGCTTG^ 

GCATATGAGAGACCATCTCTTGATTTT<^PTCCTGTGTAATTCCCAGAG^ 

ATGTAAGAAGAGAATGCTGC^CAAAGAACTTGTTCAAAGATAATATTGATGTAAGTCCTG 

TTTTATAACTTTCTAGCTGTGTTTTTGTTGTTTCTCAGCTAGATTCTCCTAACGGTATTC 

TTGT AG CT AGGGTG ATCAG ATTGTTTGT AT ATTG CT AG CAG AGTT AGTTTGT CT AG ATTG 

TAACACATATAAGAGGAAGCTTAGAGTTTCTATGGTTTAAAGAGAAGTTTTTTCCTTCTC 

CAATGTAAAAAAAAAAAAAAAAAAA 
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FIG. 16H 



SRPalO (11261) 



AAAAAATGGGAAACCATCACTCTTGATGAACTTATGATCAATCCAGGAGAGACAACGGTC 
GTCAACTGCATTCATCGGTTACAATACACTCCTGATGAAACTGTGTCATTAGACTCTCCA 
AGAGACACGGTTCTGAAGCTATTCAGAGATATCAATCCTGACCTCTTTGTGTTTGCAGAG 
ATTAACGGAATGTACAACTCTCCTTTCTTC^TGACGAGGTTCC^ 

TACTCTTCACTCTTTGACATGTTTGACACCACAATACACGCAGAGGATGAGTACAAAAAC 
AGGTCACTGTTGGAGAGAGAGTTACTTGTGAGAGACGCGATGAGCGTGATTTCCTGCGAG 
GGTGCAGAGCGGTTTGCGAGGCCTGAAACCTACAAGCAATGGCGAGTTAGGATTTTGAGA 
GCCGGGTTTAAGCCAGCAACTATTAGCAAACAGATCATGAAGGAGGCTAAGGAAATTGTG 
AGGAAACGTTACCATAGAGATTTTGTGATCGATAGCGATAACAATTGGATGCTTCAAGGA 
TGGAAAGGAAGAGTCATCTATGCTTTTTCTTGCTGGAAACCTGCTGAGAAGTTCACAAAC 
AATAATTTAAACATCTGAAAAATGTTACTTCT^ 

TTTTGTAGAATATGTTTGATCCCGTGAGTGGATGCAACTCTTTTTTCCTGCAAGTACATA 
TTGTATTCAAATCCTTGTGGAAATGATAAATTGTTTAATCAAAAAAAAAAAAAAA 
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SRPall (18652) 

GCGAATGTTGAGATCTTGGAAGC^TAGCTGGGGAAA.CCAGAGTCCACATTATCGATTTT 
CAGATTGCACAGGGATGAGAATACATGTTTTTGATTGA^ 

GGGCCGCCGTTGCTGCGTGTGACGGG'TGTGGATGATTCACAGTCCACCTATGCTCGTGGG 

GGAGGACTCAGCTTGGTAGGTGAGAGGCTTGCAACTTTGGCGCAGT^ 

TTTGAGTTTCACGATGCCATCATGTCTGGGTGCAAGJGTGCAGCGGGAACATCTCGGGTTG 

GAACCTGGCTTTGCTOTTGTTGTGAACTTCCCATATGTATTACACCACATGCCAGAC^AG 

AGCGTAAGTGTTGAAAAATACAGAGACAGGCTGCTGCATCTGATCAAGAGCCTCTCCCCA 

AAACTGGTTACTCTAGTAGAGCAAGAATCCAAC^CAAACACCTCGCCATTGGTGTCACGG 

TTTGTGGAAACACTGGATTACTACACAGCGATGTTTGAGTCGATAGATGCAGCACGGCCA 

CGGGATGATAAGCAGAGAATCAGCGCAGAACAACACTGTGTAGCAAGAGACATAGTGAAC 

ATGATAGCATGTGAGGAGTCAGAGAGAGTAGAGAGACACGAGGTACTGGGGAAATGGAGG 

GTCAGAATGATX3ATGGCTGGGTTCACGGCTTGGCCGGTCAGCACATCTGCAGCGTTTGCA 

GCGAGTGAGATGCTGAAAGCTTATGACAAAAACTACAAACTGGGAGGCCATGAAGGAGCG 

CTCTACCTCTTCTGGAAGAGACGACCCATGGCTACATGTTCCGTGTGGAAGCCAAACCCA 

AACTATATTGGGTAAGTTATAGTGATGATGGTTACTTGAGTGGATAAAGAAGAGCACAAC 

AAAAACACATCTGTCGCTGTAAATTTTTTAGGATGTGCAATGATGTTTTAAGTTGTAACA 

CAACCTAAGTTATATATGTATACAAACCAAACCTCGTGGTTGTTTTTCTCTTGTAAATTG 

TCATGTGGTTGTGGGTGGGAAGCTAGTAATGAAATATAACCAAAACATTGATTAGGTCAA 

AAAAAAAAAAAAA 
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SRPal2 (23196) * 



TCTTACTCAAGGTTCTTCTTTGTCATCTTGTT GCCG AATCCACAAAGAGGAGAATAAAGA 

TTCGACCTTTATTAGATATTAACGACTCTGGATTTTTGGGTTTTTGGAGTTGGATCC^ 

TGGGTTCTTATCCGGATGGATTCCCTGGATCCATGGACGAGTTGGATTTCAATAAGGACT 

TTGATTTGCCTCCCTCCTCAAACCAAACCTTAGGTTTAGCTAATGGGTTCTATTTAGATG 

ACTTAGATTTCTCATCCTTGGATCCTCCAGAGGCATATCCCTCCCAGAACAACAACAACA 

ACAACATCAACAACAAAGCTGTAGCAGSGAGATCTGTTATCATCTTCATCTGATGACGCTG 

ATTTCTCTGATTCTGTTTTGAAGTATATAAGCCAAGTTCTTATGGAAGAGGATATGGAAG 

AGAAGCCTTGTATGTTTCATGATGCTTTGGCTCTTC^ 

AGGCTCTTGGTGAGAAAGACCCTTCTTCGTCTTCTGCTTCTTCTGTGGATCATCCTGAGA 
GATTGGCTAGTCATAGCCCTGACGGTTCTTGTTCAGGTGGTGCTTTTAGTGATTACGCTA 
GCACCACTACCACTACTTCCTCTGATTCTCACTGGAGTGTTGATGGTTTGGAGAATAGAC 
CTTCTTGGTTACATACACCTATGCCGAGTAATTTTGTTC 

ACAGTGTCACCGGTGGTGGTGGTGGTGGTAATAGTGCGGTTTACGGTTCAGGTTTTGGCG 
ATGATTTGGTTTCGAATATGTTTAAAGATGATGAATTGGCTATGCAGTTCAAGAAAGGGG 
TTGAGGAAGCTAGTAAGTTCCTTCCTAAGTCTTCrrCAGCT 

ACATCCCTATGAATTCTGGTTCCAAGGAAAATGGTTCTGAGGTTTTTGTTAAGACGGAGA 
AGAAAGATGAGACAGAGCATCATCATCATCATAGCTATGCACCACCACCCAACAGATTAA 
CTGGTAAGAAAAGCCATTGGCGCGACGAAGATGAAGATTTCGTTGAAGAAAGAAGTAACA 
AGCAATCAGCTGTTTATGTTGAGGAAAGCGAGCTTTCTGAAATGTTTGATAACATGTTCC 
TATCTGGCCCTGGGAAACCTGrrATGCATTCTTAACCAGAACTTTCCTACAGAATCCGCTA 
AAGTCGTGACCGCACAGTCAAATGGAGCAAAGATTCGTGGGAAGAAATCAACTTCTACTA 
GTCATAGTAACGATTCTAAGAAAGAAACTGCTGATTTGAGGACTCTTTTGGTGTTATGTG 
CACAAGCTGTATCAGTGGATGATCGTAGAACCGCCAACGTTTAGCTAAGGCAGATACGAG 
AGCATTCTTCGCCTCTAGGCAATGGTTCAGAGCGGTTGGCTCATTATTTTGCAAATAGTC 
TTGAAGCACGCTTAGCTGGGACCGGTACACAGATCTACACCGCTTTATCTTCGAAGAAAA 
CGTCTGCAGCAGACATGTTGAAGGCTTACCAGACATACATGTCGGTCTGCCCTTTCAAGA 
AAGCTGCTATCATATTTGCTAACCACAGCATCATGCGTTTCACTGCAAACGCCAACACGA 
TCCACATAATAGATTTCGGAATATCTTACGGTTTTCAGTGGCCTGCTCTGATTC^TCGCC 
TCTCGCTCAGCAGACCTGGTGGTTCGCCTAAGCTTCGAATTACCGGTNNNNNKNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNGAGTTCAGGAGACA 

GTCAGCGACACAATGTTCCGTTTGAGTACAACGCAATTGCTCAGAAATGGGGAAACGATC 

CAAGTCGAAGACITAAAGCTTCGACAAGGAGAGTATGTGGTTGTGAACTCTTTG 

TTCAGGAACCTTCTAGATGAGACCGTTCTGGTAAACAGCCCGAGAGATGCAGTTTTGAAG 

CTGATAAGAAAAATAAACCCGAATGTCTTCATTCCAGCGATCTTAAGCGGGAATTACAAC 

GCGCCATTCTTTGTCACGAGGTTCAGAGAAGCGTTGTTTCATTACTCGGCTGTGTTTGAT 

ATGTGTGACTCGAAGCTAGCTAGGGAAGACGAGATGAGGCTGATGTATGTGTTTGAGTTT 

TATGGGAGAGAGATTGTGAATGTTGTGGCTTCTGAAGGAACAGAGAGAGTGGAGAGCCGA 

GAGACATATAAGCAGTGGCAGGCGAGACTGATCCGAGCCGGATTTAGACAGCTTCCGCTT 

GAGAAGGAACTGATGCAGAATCTGAAGTTGAAAATCGAAAACGGGTACGATAAAAACTTC 

GATGTTGATCAAAACGGTAACTGGTTACTTCAAGGGTGGAAAGGTAGAATCGTGTATGCT 

TCATCTCTATGGGTTCCTTCGTCTTCATAGATGTTGTTTCTTACGTTCTAAGCGACTGGG 

ATTTATGTAGGGCTTTTCTGTTGATAGTCTCTCGCCAACACGAGTGGATTAAGTTCAGAG 

TTAGGGTTCTTGAACACrrAGAATGTTGTTATATTATGCTTGTGACATAGCGTGTGTAAGA 

GTGTAGCCTAAGAGATATAGTACTCATTGCATGATCTTTTGCTATATGTTNCATGT 
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FIG. 16K 



SRPdl 

TCTG CAG ACAATTTTN AGG AGG CCAATACCATG CT ATTGG AAATTTCAG AACTG 
TCCACACCTNNNNNNNNNNNNNNNNNNNNNNN^ 
AATGTCGGNNAGATTAGTTAGCTCCTGCTTAGGAATCTATGCTTCT 
AACAGTGGTGCCTCCTCATGGTCAGAAAGTGGCCTCA 
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FIG. 16L 



SRPql 

TCAACTGAGAATCTAGAAGATGCCAA(^GATGCra^ 

TCAACACCGTTCNN<^CTTCAGCACAGCGTGTGGCAGCATATTTCTCAGAAGCC 

ATATCAGCAAGGTTGGTGAGTTCATGTCTAGGGATATACGCAACn^TTGCCACAC 

ACACACCAAAGCCACAAGGTAGCTTCAGCTTTTCAAGTGTTCAATGGTATTAGT 

CCTTTAGTGGAGTTCTCAC&CTTCACAGCAAACCAAGQU^TTCA^ 

GAAAGAGAAGAGAGGGTCCACATCATAGATCTTGATATAATGCAAGGGTTG 
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FIG. 16M 



SRPpl 

TCTGCAGACAACTTTG2VAGAAGCCAATAC2U^TACTGCCTCA.GATCACAGAA.CTC 
TCCACCCCCTATNGCAACTCGGTGCAACGAGTGGCTCCCTATNNNNNNNNNNNN 
NNNNNKNNNNNNNNNNKNNHNNNNTGCATAGGAATGTATTCT 
ATTCACATGTCCCAGAGCCAGAAAATTGTGAAT 
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FIG. 17A 



Partial DNA sequence of ZCARECRCW gene 

GATATCAGCATCATCAATTTTAAATGTAAGTTCGCA 
AATTTGGCCACAAGGTATGACACTGTCTCAATTGAGCAAT^ 

TCATATATTG<7IX1ATATT6AAAGTGAAAAAGATATGC7CAAGAACCTAGTAGAGAAGCTA 

AAAATTGAAAAATCTAGCTCTACTAGAAAAAIATGATAGGT^ 

TTTATTAGATAATCATATGATGGCTAGATGTCGCTC&XG&GGTTC 

ATTCCTGTGGGCATTCATCTCTTTTAGATGC&CTAACATGATAGGAAGT^ 

TGCTTCACAATTCTGGTGATTCATGCTTCCTTCATT^^ 

ATGCTTCAGTCACTTTGTGCGTTTAATTGGTATTGTATGTATCACTAGATTGTAGGGTGT 

eTGCAACTAGTGTTTCACCATGTGGTTTTTT^^ 

TATTGATATATTAAAGTGATAACTAGTTTTAGAAATATTCT^ 

AACTTGTTTTTAGCGTGTACGTTAGCATTATAATATTTCCTTATTATGAAAGCGGAAGAG 

AAACGCGCCGAACCAGAGCATCCACGTCGTCTCATTTCACCT^^ 

TGAGCGGTCC&<X:GTGAACTCCGTTTGCCTCCA^ 

TAGCTTCTAGAAACATCACXrATGTGTCCCGTCCATTCCTTTAGGAGGAGCCGGATCCGG 
GCCGCAGTCGCCCAAGGTCCCGAaX«K>3CX3G«rra3GC 

ggaggtgcagcggoggaagcagcgogacgaggagggcctccacctgctgagtgctgacgc 

tgctgctgcagtgcgcggaggccgtgaacgcggacaacctcgaogacgcgcaccagaogc 

tgctggagatcgcggagck^ccacgccgttcggcacctcgacccagcgcgtggccgcct 

acotcgcggaggccatgtcggcgcgcgtc«t<^^ 

tgccgccgggctccxzccgccgcggcgcgcctc^ 

tgttcaacggcatcagccckttcgtcaagttctcgcacttcaccgcgaac^ 

aggaggcgttotagcgggaggagcgtgtgcacatcatcgacc^^ 

tgc^gtggccgggcctcttccacatccttgtctcccgccccggcggcccgcccagggtca 

ggctcaccggcctgggggcgtccatggacgc^^ 

acttcgccgaca03ctcgg<xtgcccttcgagttctg 

ACGTTGACC03<^GAAGCTGX3XK»lK»aX^ 

ACCACTCGCTTTACGACGTCATOGGCTCCGACTCCAACACGCTCTGGCTCATCCAAAG^ 
CCKX^TTTTCK^PTCrcPGCOT 

TTCAGCTGCTGACATrGGATAATGTGAGCTTTACGGCAAGGATCAAGTCGTGGTAGTACA 
TCCATTACAGCTATTTCTAAAATATTCTTCGGAGGTTTCCTGCTCATAGTAAAAAAAAAT 
CGCGTTTTGAAGCTCAAAAGGCGATTTCTTCCGAGkSTT^ 
AACCCC^TTTTCTCAATTGATTTTTATTTT^ 

TGAAATGGAGTCCCAAACTAACCCTAATATTAAAAAAAACGCGCTTTGGAGCT 
CTCGTTGTTATGACCAACCAGCTTTATAGGTTTAAAAAGGTTC^ 

TGAAAAGGTTGAATCTTGACAATGCTTTTGAGATGATACTGTAGTGTAGTCTGTAGTGGA 
GCATCCTCCATGGTCTTTGGTGATCGAGAATTCCTGCAGCCCGGGGGATCC 
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FIG. 17B 



Partial amino acid sequence of ZCARECROW protein 

YQHHQFXMXVGKRSXGFSXXFGHKVXHCLNXAIXXRNXSIIYCSYXKXKRYAQEPSREAK 

NXKIXLYXKNHIGCLFIJlKIYXIIISVnJDVAHE^^^ 

LHNSGDSCFIJICNXYXCXIHASVTIX^ 

LIYXSDNXFXKYSLVPIJlLQLVFSVyVSIIIFPyYESGRETRPTRASTSSHFTFIVGSXM 
SGPRXTPFACKTTSSTRCXVASRiniTMCFVHSFRRSRIRRRSRPRSRPPRPRPPPPIlSGR 
RCSGGS£^TPJEtASTCX VLTIiLI^CAEAVKADNIJ3DAHOTI^ErAEIiATPFGTSTORVAAY 
FAEAMSARWSSCLGLYAPLPPGSPAAARmGRVAAAFOVFKGISPFVKFSHFTAWOATp 
FAFEREFJtVHIIDLDIMOGIiQWPGLFHILVSRPGGPPRVRLTGlXjASMDAIiEATGKRIjSD 
FADTI^riPFEFCAVAFJCRGNVDPOKLGVTRREAVAVHWPHHSLYDVTGSDSWTLWIiIOR S 
SIFI^I^SKSNI^IMTTFQLLTIIHWSFTA^ 

RFEAQKAISSEVCCXALFWKPHFI^FLFFKEKLVHFSLVKWSPKLTLILKKTRFGAQNA 
RCTDQPALXVXKGXILTMLLKRLHLDHAFEMILXCSLXWSILHGLWXSRIPAARGX 
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FIG. 18 



SCR 
SRPdl 

SRPgl 

SRPpl 



302 349 
SADNIiEEANKLLLEISQLSTPYGTSAQRVAAYFSEAMSAIlLLNSCLGI 
SADNFxEANTMLLEXSEIiSTPXXXXXXXXXXYFSXXMSxRLVSSxLxI 



S&DNFEEANTILPQrTELSTPYxNSVGRVAAYxxxxxxxxxxxxCIGM 



350 396 
SCR YAALPSRWMPQTH-SLKKVSAFQVFNGISPLVKFSHFTANQA.IQEAFE 
SRPdl YASLPATWP — PHGQKVAS 

SRPgl YATLP HTHQSHKVASAFQVFKGISPLVEFSHFTANQAIQEAFE 

SRPpl YSPLPPIxMSQ SQKIVN 



SCR 
SRPgl 



397 412 
KEDSVHIIDLDIMQGIi 
REERVHIIDLDIMQGL 



PCT7US00/05875 



FIG. 20 
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FIG. 



68/101 
21 



PCT/US00/05875 



SCR Promoter: :GUS SCR Promoter::SCR 
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Fig. 22 
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Embryo development in maize, (a) Three-celled embryo showing first division of 
terminal cell. (b)-(c) Embryos showing embryo proper and suspensor. (d)-(e) Embryos 
showing the initial development of shoot and root apical meristems. (f)-(g) Embryos 
showing the elaborate organization of shoot and root apical meristems. 



Fig. 24 



PCT/US00/05875 



ctgctagctcagcctactcactccactcaactcacccccaactccactccgctcccgagc 
ccggactgactgactgactgtggtggtggtggcgcatcagcagcccgcgcggcgccaaaa 
cacgcaaactgctccctccctcactcacccctatcccccgcgctgggtcgcccgatcgcc 
atgcgcgcggcggcttcctctCggcgtttctagatgggctcctcctcctccctcctcttc 
tcctcgtcctcctccgccgcatccaccgccccccactcctttccccactctcATGCCACC 



PPPPPPLTPYCRRCPPPHLP 
TCCGCCTCCTCCITCTTCCCaVAACCACTTCC^^ 



P P P P S 



Q EAAAAAMVR K R PA 



GCCX3CCGCGCCGCCAOGTCACGGGCX»CXrrCTCOT^^ 




C3GTGGACGGCATCATCCGCGACATCATCGGGAGCAGCGGCGGC<K;CGCGGTCTCCATCAC 
VDGIIRDIIGSSGGAAVSIT 

GCAGCTCATCCACAACGTCWXXSAGATX^TCCACCCCTGCAACCCXXjGCCTCGCGTCGCT 
QIjIHNVREIIHPCNPGLASI/ 

CCTGGAGCTCCGCCTCCGCTCCCTCCTCGCA^ 



GCAGC^CAG<»GGAGGAACCXX^TCCGGCGCCGCAGT« 



Q Q Q Q B 



P H P A P Q 



P K A P T A 



AGAGACCGCAGCX3GCGGCCGCCXXXX3<^CyAGCAGCAGCnX3CTGCGGCXX5CCAAGGAGCX3 



E T A A 



A A A Q A A 



A A A A K E R 



GAAGGAGGAGCAGCXSGCGGAAGCAGCGCGACGAGGAGGGCCTCCACCTGCTGACCKnKKJT 
KEEQRRK QRDEEGLHLLTLL 
GCTGCAGTGCGCCGAGGCCGTGAACGCGGACAACCTGGACGACCXXSC^ 



L Q C A E A V 



D D A H Q T L 



GGAGATCGCGGAGCTAGCGACGCCX5TTCGGCACCTCX3ACX3CAGCGCGTGGCCGCCT 



EXAELATPF 



Q R V A A Y F 



A E A M 



ARLVSSCLGLYAPLP 



P G S P A A A 



LHGRVAAAFQVF 



C^CGGCMCAGCCCCTTCGTCAAGTTCTCGCAC^ 



PFVKFSHFT 



Q A I Q E 



GGCGTTCGAGCGGGAGGAGOGCGTG<^<^TCATCGACCTCGACATCATGCAGGGGCTGCA 
AFEREERVHIIDLDIMQGLQ 
GTCGCCGGGGCTCTTCCACJ 



WPGLFHILASRPGG 



P R V R 



CACCGGCCTCGGGGCGTCCATCGAGGOGCTCGAGGCCACGGGGA^ 



ASMEALEA 



120 
180 
240 
300 

360 

420 



PNHFLLHYLHQDDH 



CCAAGAAGCCGCCGCCGCCGCCATGOTCCGCAAGCGCCCCGCGTC^ 



D M "D L P 



G K R L S D F 



E K A G N V 



540 
600 
660 
720 . 
780 
840 
900 
9G0 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Fig. 25A 
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TGACCCCX3AGAAGCTAGGGGTCACGAGGCGGGAGGCCX3TCX^CGTCCACTGGCTGCACCA 1800 

484 DPEKLGVTRREAVAVHWLKH 

CTCGCTCTACGACGTCACTGGCTCO^CTCCAACACGC^^ 18 SO 

504 SLYDVTGSDSNTLWLIQR 

aggagtacaccatctctcgatcctgacttccttgctaccatgtcaaatcttgatgcaatc 1920 

atggccacttttcagctactaacactttagtttagccaatgcgacatccagtacaactaa 1980 

tctaaaaaaataatcttcagaggtttcctagtaaaaaaaccgcgtttttggagctcaaaa 2040 

agcttgtcattatgaccaaccaactttctaggcttaaaaaggttgaatcttggcaatgct 2100 

tttgagacgatgctgtactgaagtactggtagagagagtatcctccatggcctttgt'tga 2160 

tcccagaaccacaaaagatagtatttcgctcgcatttggttagtggaggtgttctgatca 2220 

tcacttggaggatggagctgaaagttcctatcatcatgaccaactetccatggcaaaagg 2280 

tttctagttccaagtggcaggacgatgattactgagtgactgaatggagtaactgtcatc 2340 

ttctaccactaaccatcatttattaatacataaatcatcatccggagcctaaactcagaa 2400 

aggctaatcaaaagtgcaatctttctcaaatggctgccatatgccagtggtacatgcctg 2460 

gccattgtactttttcggtgaaccatctcgtctcaagcatgagatgaaggcctgaactgc 2520 

aatgtccttgatttgatgcaaccattattagaagaaacgctaagcgatgccggtcctggc 2580 

aagggcaatgccatatcgtcagacagacagggattcggaatcgaatggctagctggtgac 2640 

aaatcgcacggggattaataaactacattggtcattgattccatcccccacacacctgca 2700 

gGCTGX^CCCCAAGGTGGTGACAATGGTGGAGCAGGACCTGAGCCAC^PCGGGXJTCCrTCC 2760 

522 LAPKVVTMVEQDLSHSGSF 

TGGCGCGCTTCGTGGAGGCCATCCACTACTACTCGGCGCTGTTCGA 2820 

541 LARFVEAIHYYSALFDSliDA 

GCTACGG<^AGGACAGCCCCGAGCGGCACGTCGTGGAGCAGCAGCTGCTGTCGCGGGAGA 2880 

561 SYGEDS PERHVVEQQLI«SRE 

TCCGCAACGTGCTGGCCGTGGGCGGGCCGGCCCGCACCGGCG^ 2940 

581 IRKVLAVGGPARTGDVKFGS 

601 WREKIiAQSGFRAAS X,AGSAA 

CGCAGGCGTCCCTGCTGCTCX3GCATGTrcCCCTCCGACGGGTACACGCTGGT^ 3060 

621 AQAStl-LGMFPSDGYTLVEE 

ACGGCGCGCTGAAGCTCGGGTGGAAGGACCTCTGCCTGCTCACC^^ 3120 

641 NGALKLGWKDIiCkLTASAWR 

CCATCCAGGTGCCGCCGTGCCGTTGAtgagacctctgcctgctcctgcttgcgttgagag 3180 

661 PIQVPPCR* 

gccgccactccacttgttttgcatctgtagctgctcggtttggtcatcagctgggagata 3240 

agaaaagcggaaacgtactaattgctctggagtagatccatccattcacagtgatagtta 3300 

ctgatgtactaagctttaattagttcaatgctagatcgttcttgttcaggtgtcgatcgc 3360 

gtatccttgtccttggtctccttttcattttggtgccttgtctagtcgctttcccgacta 3420 

atgccgtgctcttcatgcgcgttctagtgaagattcttgccgagaatattagcatagttt 3480 
tcatgtaaagtagccatcaagcaagtatta 3510 



Nucleotide and deduced amino acid sequence of the maize SCARECROW. 
Amino acid numbers are shown to the left; nucleotides are numbered 
on the right. Forward and reverse primers tested are underlined (J1050F 
and J1450R) . 



Fig. 25B 
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Zot SCR VDGIIFDIIG S33GAAVSIT QLHWVREII HPCMFGL&SU LELRLBSIiiA 193 
At SCR VDAirRDLIH SS-- TSVSIP QLJQNVRDII FPCNPNIfiaL IfXRIfiSUH. 190 



Fig. 26A 




Fig. 27 
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Z25645 

ctttgtcaat ggtaaatgag ctgaggcaga tagtttctat ccaaggagac ccttctcaga 
61 gaatcgcagc ttacatggtg gaaggtctag ctgcaagaat ggccgcttca ggaaaattca 
121 tctacagagc attgaaatgc aaagagcctc cttcggatga gaggct^bgca gctatgagat 
181 cctgtttgaa gtctgccctt gtttcaagtt cgggttttta gcagctaatg gtgcgatact 
241 tgaagcaatc aaaggtgaag aagaagttca cataatcgat ttcgatataa accaagggaa 
301 ccaatacatg acactgatac gaagcattgc tgagttngcc tgggtaaacg acctcgcctg 
361 aggttaaaca ggaattgatg accctgaatc cagtnccaac cgctccattt gggggggcct 
421 aaagaa 

Fig. Z&N 



ttaaagctat tcccggtgac gcgattctca atcagttcgc tatcgattcg 
ctaaccaagg cggcggagga gatacgtata ctacaaacaa gcggttgaaa 
gcgtcgtgga aaccactaca gcgacggctg agatcaactc ggcatgttgt 
tcgcaggaga acggtgtgcg tctcgttcac gcgcttttgg cttgcgctga 
gaaagagaat ctgactgtag cggantctgg tgaagcaaat cggattotta 
aaatcggagc gatgagaaaa gtcgctactt act 



gagtacgatc 
61 gcttcttcgt 
121 tgctcaaacg 
181 cctggttgac 
241 aagctgttca 
301 gccgtttctc 



Fig. 233 
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77/101 

Z34539 

aaatttttca afctacctaat ataatgaaag ataagatctt aacaagtgac aaagggaaaa 
61 acagtaggat ttagtttggc fctcggtcgga aatctatcat cataaccggt tcaacagatc 
121 aattcattga gccaccatct aattggtgag agtttccaag ccgaggtggc tatgagcggt 
181 cgtgtgtgcc aacccaacat gagacagccg tcactctcct ccacccgata accctcaccg 
241 ecgttgaaca gagccaaaag catactcgct tgcttaaacg cattcgaacc aatatgtgca 
301 gccgcaaacc cagcagaccc gaaccggttc ctccantgac ttcaacgttt catgacggtt 
361 caacttcggt ca 

Fig. 2&C 



Z33772 

ttttttttta agtgagaacc ttaacaaatt taaccatttg aactgaaata tgaacatgta 
61 aagactcatt cacacttagc aaataggttt agaaccaaaa ctctaattat ttttatataa 
121 tagggaaaaa aaagaaagaa aaattcttcc ataagtgtta gattagcttt tagtacctgfc 
181 gatcacccct aacctctggt aataatacat ggagatgatt taaccagtta cacaataacc 
241 caagattaca gtaaaaacat aattatgttt tatgaaacat aaagactata tgctcfctgtc 
301 acfctafcctta cctccaagct gaagcaacgg attaagcttt tctcctccca gcaaaaatgg 
361 gagctcaccc atttcttctt taaggttgta cttnttgca 



Fig. 28>V 
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Z37192 





gctatggaag 


gagagaagat ggttcatgtg attgatctcg atgcttctga 


gccagctcaa 


61 


tggcttgctt 


tgctteaagc ttfctaactct aggcctgaag gtccacctca 


tttgagaatc 


121 


actggtgttc 


atcaccagaa ggaagtgctt gaacaaatgg ctcat^gact 


cattgaggaa 


181 


gcagagaaac 


tcgatatccc gtttcagttt aafccccgttg tgagtaggtt 


agactgttta 


241 


aatgtagnac 


agtttagggt ttaaacagga gaggcnttag ccgttagctc 


ggttcttcaa 


301 


ttgcata 










Fig. 23B 





Z37191 

ccgatcatca aattagttat cttcagctca 
61 ccagaccaaa ttgaaccaac acacaaaggc 
121 gccaaaatag cataaagaga cagaaagtca 
181 atgtatacaa aaaagaaagg gactctgctc 
241 atcattcact tcctgcatct ccaagctgat 
301 attactgcgc acccgntctc ttccttgatt 



aattggattt ggtttggtat tacacocaca 
tttacatgca gaggcagtag aagcatttaa 
ccatcacaaa acaactaaga ttgtgtcccc 
ataaccaaaa tagaagacaa actgtaatat 
accgagtata gaggtcgatc ttgccagcaa 
ctatacccat caaaa 



Fig. 26F 
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Z46550 

gtggaattac aattacagca atttgtattc aattgttgaa tctaagcctg gcttcatctc 
61 tttggcctgg aacgatttac ctctcctcac tctttcttcc tggcgataac caaaccaaac 
121 cgatccggta ttcttagttt tgttttgttt tcaatgttat ttttggttag acaaatattc 
181 aattgttaat atactccgtg gtcagagtgt tttgtttttc ttttagttcg aacgttgaat 
241 taattcaggg gtaggttttg aattctctga accttatgtg ttttttggta acatcatttg 
301 gatttgtgaa ctaggtttaa aaactggtct tagtcttgtt gttttctcat tagataattt 
361 aaactggttt gcttctttat ttttgggttg ggataaaagt gaccgg 

Fig. 2&G 



Z3&04-& 

gtggaattnc aattacagca atttgtattc aattgttgaa tctaagcctg gcttcatctc 
61 tttggcctgg aacgatttac ctctcctcac tctttcttcc angcgataac caaaccaaac 
121 cgatgecggt attcttagtt ttgttttgtt ttcaatgtta tttttggtta gacaaatatt 
181 caattgttaa tatactccgt ggtcagagtg ttttgttttn cttttagttc gaacgttgaa 
241 ttaattcagg ggtaggtttt gaattctctg aacctnatgt gttttntggt aacatcattt 
301 ggatttgtga actaggttta aaaactggnc ttagtcttgt tgttttctca ttaggataat 
361 ttaaactggt ttgcttcttt attttnggtt gggataaagt gaccgg 



Fig. 2£>H 
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caaaactaca tttcatcact 
61 tatttggctfc tcaacttccfc 
121 gaaccggtca ctttatccaa 
181 gaaaacaaca agactaagac 
241 aaaaaacaca taaggttcag 
301 ctaaaagaaa aacaaaacac 
361 ccaaaaataa cattgaaaac 



Z33085 

tttttgagca aaattacaaa 
aattttatga aatagtaatt 
ccaaaaataa agaagcaaac 
cagtttttaa acctagttca 
agaattcaaa acctacccct 
tctgaccacg gagtatatta 
aaaacaaaac tanggaatac 

Fig. 201 



taaaagagta gttacaaata 
acatctcaaa cagatgacca 
cagttfcaaat tatctaatga 
caaatccaaa tgatgttacc 
ganttaattc aacgttcgaa 
acatttgatt atttgtctaa 
cggatcggt 



cccaacgggt cctgagcttc 
61 caaattcggt tatgaatctg 
121 tgtgcacatt atcgatttcc 
181 tcttggtgct agacctggtg 
241 atcatcgttt gctcgtcaag 



F13£>96 

ttacttatat gcatatcttg 
ctaatggagc tatagctgaa 
agatttctca aggfcggtcaa 
gacctccgaa cgttaggata 
gaggacttgc agttagttgc 



tatgaagcct gcccttattt 
gctgtgaaga acgaaagttt 
tgggtgagtt tgatccgtgc 
acgggaattg atgatccgag 
acaaagcact tggca 



Fig. 25J 
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F13£97 

gggtcatcaa catatcactt actactacaa catttgacaa cttgttcctn cggatcatgc 

61 atgagtttta cttttacaaa cagattctgc aaactttaaa agcaagtttc taatctcttc 

121 tgaaaccgaa caaggttttt attagttacc tccaagcaca agaagtgata agaggttgat 

<r 

181 tcttccatcc taaatacaat gctccatctc tttcttcaag tgtatacttc tctgaafcaac 
241 tetcaagcaa tcctttgatt gttgcgttca catacgagct caaaggatac ggtttaaatc 
301 ccgccatgtg aaaccgaga 

Fig. 2&K 



F13949 





caaaaattta 


tatatttgtg tgaacttaaa tttaaaaatc catcgcactg 


agcaaaataa 


61 


nntcagaaac 


taaaaatttg tcatttaaga taaattgaat taaggaaaat 


atttttttaa 


121 


taattgaaac 


tccggtggaa atcaggagga gcgacatctc catgctgaaa 


ctccgacgag 


181 


ttctgtcett 


tgccaacata ggagaagtga gtfcatgttfcc tcctcgacgt 


gaaagcctct 


241 


cactggcgtc 


cgttggntna aacactcggc ttgagactcc gtgaagttac 


tgtgcgtcac 


301 


cggtgagaaa 


cccatctgta gaaacatcgc ttgccacgtc atcatcggcc 


ttfcctatcgg 


361 


acggctacga 


tccaacacca gcttctctat ctccggotgt ataaggaaa 





F\0. 231 
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T227£2 





ctattttnac 


aatttatttt gttattagaa gtggtagtgg 


agtgaaaaaa 


caaatcctaa 


61 


gcagtcctaa 


ccgatccccg aagctaaaga ttctnc&cct 


tcccaaataa 


agcaaaacct 


121 


agatccgaca 


ttgaaggaaa aaccttttag atccatctct 


gaaaaaaacc 


aaccatgaag 


181 


agagatcatc 


atcatcatca tcatca&gat aagaagacta 


tgatgatgaa 


tgaagaagnc 


241 


gacggtaacg 


gcatggatga gcttctagct gttcttggtt 


ataaggttag 


gtcatccgaa 


301 


atggctgatg 


■tttgetcaga aactcgagca gcttgaagtt 


atgatgrtcta 


atgttcaagn 


361 


aagncggtct 


ttnfccaactt cgcnacttnn gactgttcac 


tntaatncgg 


cggnngtttt 


421 


caacgntggc 


ttgntttcna tgntnaccga ccttaat 
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atgggaaagg 


agcatttaat ctcgactcaa ttgctctacg 


agctctctcc 


ttgtttcaaa 


61 


ctcggtttcg 


aggccgcgaa tctcgccatt ntcgacgccg 


ccgataacaa 


cgacggtgga 


121 


atnatgatac 


cgcacgtaat cgatttcaat atcggagaag 


gtggacaata 


cgttaacctt 


181 


ctccntacat 


tatccacgcg ccggaatggt aaaagtnaga 


gtcagaatfcc 


tccggtggtt 


241 


aanatcaccc 


gccgtggcga acaacgttta cgggatgttt 


agtcggatga 


cgggtggnga 


301 


agagaggttt 


aaaagcccgt ncgngntttt ttttgnagcc 


actncngntn 





Fig. 2&H 
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acfccggtatc tccgtaagtt tcaacgtggrt gacgagttta cgactcggtg atctgaatcg 
61 tnaatetntc gggtgtnatc ccgacgagac tttggcfcgta aacttagctt tcaagcttta 
121 tcgtgttccc gacgaaagcg tatncacgga gaafcccaaga cgaacttctc cggcgcgtga 
181 agggacttaa accgcgcgtg gttactctag tggagoaaga aatgaattcg aatacggcgc 
241 cgtttttagg gagagtaagt nagtcatgcg cgtttnacgg tgcgttnctt gantcggtcg 
301 agtctacggt tcctagtacg gatttccgac ccgfcgccaaa atttnnggaa ggaatttgcc 
361 cgnaannttn naaaccgggt g 

Fig. 2.80 



atnaaaagtc tttttttttt ctttgfctaca taagattcct acacttttcg aaatggaaaa 
61 tcacaatgat aataatatca gaataatctc gaaaattaat aataatatgg taataataag 
121 aagaaaaaaa aagagtgtgt gaagttaacg ccaagcggat gcgacagtga gtgcccgtcc 
181 catcca&cca aagcacacac ctccgttatc ttctttaacg gtaaagcccg ggfcggacfccg 
241 gtttccacga cfccttcatcg actccgctat cttctcactc aatggcatta actcaaaccc 
301 agcoatgctc atccgcattc gccatttncc ggaacanctc gnaccgctct atacgntcga 
361 ttccttcgga cggcaccgng ttttactagc ttccggncaa ttccttcctn aactttggaa 
421 cggtnggatt cgttcttggg accgtaggct tggcccgctt aagaacgnac cgtacagggg 
481 nntgttttnt taatttccct , taaaaggggg cgnttttggg ttnatttttn ana 
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T43670 

caaccntttt atagtcaagc agctctcaac gctttttttt caaggtctgt naagcctcga 
61 aattatcaga ntttncaatc fcccgtcgccg atgafctganc tcacgtcggt gaatgatatg 
121 agtttntttg gnggttctgg ttcatctcag cnttacggtt taccggttcc caggtctcan 
181 acgcaacagc aacaatcgga ttacggttta tttggtggga tccgaatggg aatcgggtcg 
241 ggtattaata attatccaac attaaccggc gttccgtgta ttgaaccggt tcaaaaccgg 
301 gttcatgaat cggaggacca ttgttganta agntfcaagag agctttgtng aaacaanctt 
361 tttangattg atnaccg 



Fig. 2&Q 



T70186 

tgcatacaac gcaccgtttt tcgtaacacg gtttcgcgaa gtctatttca tttctcctcg 
61 atttttgaca tgcttgagac aattgtgcca cgagaagacg aagagaggat gttccttgag 
121 atggaggtct ttgggagaga ggcactgaat gtaattgctt gcnaaggttg ggaaagagtg 
181 gagaggcctg agacatacaa gcagtggcac gtacgggcta tgaggtcagg gttggtgcag 
241 gttccatttg acccaagcat tatgaagaca tcgctgcata aggtccacac attctaccac 
301 aaggattttg tgatcggtca aagataaccg ggtggctctt tcaaggntgg aaggggaagg 
361 anctgtcatg ggtctttctt ttttggaaac cagagtccca aggttttncc ggaaaatcct 
421 ccttgggnat ttnangnccc tttttttgtt tttttncccn gnnanttccc nggggnagtt 
481 tccagtttna ggngngtttt tncnaaaa 



Fig. 2&K 
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tgcatacaac 


gcaccgtttt tngtaacacg gtttcgcgaa 


gtctatttna 


tttctcefceg 


61 


atttttgaca 


tgcttganac aattgtncca cgagaagacg 


aagagaggat 


gttectfcgan 






ttgggagana ggcactgaat gtaattnctt 


gcnaaggttg 


ggaaagagtg 


181 


gagaggcctg 


anacatacaa gcagtggcac gtacgggcfca 


tgaggtcagg 


gttggtgcag 


241 


gttccatttg 


acccaagcat tatgaagaca tcgctgcata 


aggtccacac 


atfcctaccac 


301 


aagggttttt 


tgafcccnfccc aagataaccg gtggctcttn 


caaagctttg 


aagggaagga 




ccfet:t:cat:gg 








421 


tggaattttg 


nrmccccfctt tgattttttt fcccccggnaa 


ttticcc 
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gagacggtag 


atccgncgcg ctaaagcttc ggcgaagtaa 


gtagccactt 


tnntnatagc 


61 


tccggcttga 


nacacagcta agcatccnat ttgcttcaca 


agagcttccg 


ctagagtcaa 


121 


attgtnctnc 


tggattgctt ctgcacaagc cataagcgcg 


tggactaaac 


gaacaccgtt 


181 


ctctfcgcgag 


tnaaccagga taacagaacg anttgactca 


gccgccgccg 


tcgttgtcgt 


241 


ggtggttgtc 


gtcaccgtcg ttcctatgac tccaccaatn 


tgggtacccg 


tcgaagtcga 


301 


tgtaaccata 


ggatcagggc ttcgngcatg nttttaaaac 


gg 





Fig. 22>T 
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gtttgattcg ttggaaggag ttccgaatag tcaagacaaa gtcatntctg aagtttactt 
61 agggaaacag atttgtaatc nggtggcfctg tnaagntcct gacagagtcg agagacacga 
121 aacgttgagt caatngggaa aecggtttgg ttcgtccggt ttagcgccgg cacatcttgg 
181 gtctaacgcg tttaagcaag cnagtatnct tttntntgtn tttaatagtg gccaaggtta 
241 tcgtgtggag gagagtaatg gatgtttgat gttgggttgg cacactnngc ccactcattt 
301 accacctccg gttttggaaa c 



taaaaattga tcccaaaaag gcataaatta aaaatgacct accaaaacga tatatataag 
61 aattttaaac aagtgaacga aaataaataa aataaacaaa aggcaaaacg gttcgattca 
121 gttcggttta ggtcttggtc cgaacatatg tcatcaccgg tccactgatc tcaatctcaa 
181 attcactcgn ctcgactcca ccaccgtcgt atgcttcgag tcaaactcag tacgncgccg 
241 tcgagagttt ccaagcggag gtggtaatga gtggacgagt gtgccaaccc ancatcaaac 
301 atccattact ttcctccaca cgntaacctt ggccactatt taaacacagg caaaangcat 
361 acttgtttgc fctaaaccgcg ttagnccnaa gntttgccgg gcgntaaaec cggcngaccc 
421 aanccggntt tcccnatttg ctcaaacggt ttngtgnctt ttggcfcfctfct gnatggcctt 
481 taaangnncc 
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aaaaaatggg 


aaaccatcac tcttgatgaa cttatgatca 


atccaggaga 


gacaacggtc 




gfccaacngca 


tfccatcggtfc acaa£acacn cctgatigaaa 


ctgtgtcatt 


agactctcca 


121 


agagacacgg 


ttctgaagct attcagagat atcaatcctg 


acctctttgt 


gtttgcagag 


181 


attaacggaa 


tgtacaactc tcctttcttc atgacgaggt 


tccgagaagc 


gcttttncat 


241 


tacncttcac 


tctttgacafc gtttgacacc acaatacacg 


gagaggatga 


gtacaaaaac 


301 


aggtcactgt 


ttggagagag agttacttfct gaganacgcg 


nttgagcgtg 


attttcctgc 


361 


nngggnttca 


nancgggttt tnngggcctt aaaacctnca 


agaaatnggn 


ggtttgggtt 


421 


tt 
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aatcaatgtt 


ttggttatat ttcattacta geaacccacc 


cacaaccaca 


tgacaatfcta 


61 


caagagaaaa 


acaaccacca ggtttggttt gtatacatat 


ataacttagg 


ttgtgttaca 


121 


acttaaaaca 


tcattgcaca tcctaaaaat ttcagcgacc 


agaatgtgtt 


tttgattgtg 


181 


cctctttctt 


tatccacctc aagtaaccat cattcactat 




atct 



Fig. 23X 



WO 00/53723 



PCT/USOO/05875 



88/101 



N37425 



gcgaatgttg agatcttgga agcaatagct ggggaaacca gagtccacat tatcgatfctt 
61 aagattgcac agggatcaca afcacatgtfct ttaattcagg agcttgcgaa acgccctggt 
121 gggccgccgt tgctgcgfcgt nacgggtgtg gatgattcan agtccaccta tgctcgtggg 
181 ggaggactca gottggtagg tgagaggctt gcaactttgg cgcagtcatg tggtgtcccg 
241 tttnagtttc acgafcgccat catgtctggg tgcaaggtgc agcgggaaca fcctcgggfcfcg 
301 gaacctggct ttgctgttgt tgtgaacttc ccatatgtat tacaccacat gccagacgag 
361 agcgtaagtt tttgaaaatc acagngacag gcttctgcat ctnatcaana gcctttcccc 
421 aaactggtac tctagtaggc aagattcaac acaacacttg catena 



atgnaacata tagcaaaaga teatgeaatg agfcactatat ctcttaggct acactcttac 
61 acaegctatg tcacaagcat aatataacaa cattctagtg ttcaagaacc ctaactctga 
121 acttaatcca ctcgtgttgg cgagagacta tcaacagaaa agccctacat aaatcccagt 
181 cgcttagaac gtaaganaca acatctatga agacgaagga acccatagag atgaagcata 
241 cacgattcta cctttccacc cttgaagtaa ccagttaccg ttttgatcaa catcgaagtt 
301 tttategtae ccgttttcgg attttcaact tcagattctg catcagttcc ttctcaagcg 
361 gnagctgtcc taaatccggg tegggtcagt ctcggctggc actggttata tggctctggg 
421 ctctccactc tctctggtct tcacaaggca cancattcac aatctntttt ccataaaact 
481 nntttfccntn catnngnenn atnttggctt ccctnggntg gttggggnnc ncnt 
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tcaaggttct tctttgtcat cttgttgccg aatccacaaa gaggagaata aagattcgac 
61 ctttattaga tattaacgac tctggatttt tgggtttttg gagttggatc cacatgggtt 
121 cttatccgga tggattccct ggatccatgg acgagttgga tttcaataag gactttgatt 
181 tgccfcccctc ctcaaaccaa accttaggtt tagctaatgg gttctattta gatgacttag 
241 atttctcafcc cttggatcct ccagaggcat. atccctccca gaacaacanc aacaacatca 
301 tcaacaacaa agctgtagca ggagatetgt tatcatcttc aactgaatga cgntggattc 
361 tctgattctg ttttgagtat ataagccaag ttctnatggg agnnggtnat gnagagaagc 
421 ctttgtatgt tcatgnngnt ttggtnatta agntgctngg aaannactcn ntnngc 

Fig. 25AA 



5CL1 

LSMVNELRQI VSIQGDPSQR IAAYMVEGLA ARMAASGKFI YRALKCKEPP 
SDERIAAMQV LFEVCPCFKF GFLAANGAIL EAIKGEEEVH IIDFDINQGN 
QYMTLIRSIA ELPGKRPRLR LTGIDDPESV QRSIGGLRII GLRLEQLAED 
NGVSFKFKAM PSKTSIVSPS TLGCKPGETL IVNFAFQLHH MPDESVTTVN 
QRDELLHMVK SLNPKLVTW EQDVNTNTSP FFPRFIEAYE YYSAVFESLD 
MTLPRESQER MNVERQCLAR DIVNIVACEG EERIERYEAA GKWRARMMMA 
GFNPKPMSAK VTNNIQNLIK QQYCNKYKLK EEMGELHFCW EEKSLIVASA 
WR* 



Fig. 23AD 
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SCL 3 

AMEGEKMVHV IDLDASEPAQ WIALLQAFNS RPEGPPHLRI TGVHHQKEVL 
EQMAHRLI EE AEKLDIPFQF NPWSRLDCL NVEQLRVKTG EALAVSSVLQ 
LHTFLASDDD LMRKNCALRF HNNPSGVDLQ RVLMMSHGSA AEARENDMSN 
NNGYSPSGDS ASSLPLPSSG RTDSFLNAIW GLSPKVMWT EQDSDHNGST 
LMERLLESLY TYAALFDCLE TKVPRTSQDR I KVEKMLFGE EIKNIISCEG 
FERRERHEKL EKWSQRIDLA GFGNVPLSYY AMLQARRLTjQ GCGFDGYRIK 
EESGCAVICW QDRPLYSVSA WRCRK* 

Fig. 2&AC 



SCL 5 

GTSPTGPELL TYMHILYEAC PYFKFGYESA NGAIAEAVKN ESFVHIIDFQ 
ISQGGQWVSL IRALGARPGG PPNVRITGID DPRSSFARQG GLELVGQRLG 
KLAEMCGVPF EFHGAALFCT EVEIEKLGVR NGEALAVNFP LVLHHMPDES 
VTVENHRDRL LRLVKHLSPN WTLVEQEAN TNTAPFLPRF V ETMHH YLAV 
FESIDVKLAR DHKERINVEQ HCLAREVENL IACEGVEREE RHEPLGKWRS 
RFHMAGFKPY PLSSYVNATI KGLLESYSEK YTLEERDGAIi YLGWKNQPLI 
TSCAWR* 



Fig. 23AD 
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ecie 

AAIFYGHHHH TPPPAKRLNP GPVGITEQIiV KAAEVIESDT CLAQGILARL 
NQQLSSPVGK PLERAAFYFK EALNNLLHNV SQTLlNPYSLI FKIAAYKSFS 
EISPVLQFAN FTSNQALLES FHGFHRLHri DFDIGYGGQW ASLMQELVLR 
DNAAPLSLKI TVFASPANHD QLELGFTQDN LKHFASEINI SLDIQVLSLD 
LLGSISWPNS SEKEAVAVHI SAASFSHLPL VLRFVKHLSP TIIVCSDRGC 
ERTDLPFSQQ LAHSLHSHTA LFESLDAVNA NLDAMQKIER FLIQPEIEKL 
VLDRSRPIER PMMTWQAMFL QMGFSPVTHS NFTESQAECL VQRTPVRGFH 
VEKKHNSLLL CWQRTELVGV SAWRCRSS* 

Fig. 2£AE 



5CL11 

LMINPGETTV VNCIHRLQYT PDETVSLDSP RDTVLKLFRD 
IMGMYNSPFF MTRFREALFH YSSLFDMFDT TIHCERRDEV 
RPETYKQWRV RILRAGFKPA TISKQIMKEA KEIVRKRYHR 
MLQGWKGRVI YAFSCWKPAE KFTNNNLNI* 



KKWETITLDE 
INPDLFVFAE 
I SCEGAERFA 
DFVIDSDNNW 



Fig. 2S>AF 
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SCL 13 



ANVEILEAIA GETRVHIIDF QIAQGSQYMF LIQELAKRPG GPPLLRVTGV 
DDSQSTYARG GGLSLVGERL ATLAQSCGVP FEFHDAIMSG CKVQREHLGL 
EPGFAWVNF PYVLHHKPDE SVSVEKYRDR LLHLIKSLSP KLVTLVEQES 
NTNTSPLVSR FVETLDYYTA MFESIDAARP RDDKQRISAE QHCVARDIVN 
HIACEESERV ERHEVLGKWR VRMMMAGFTG WPVSTSAAFA AS EKLKAYDK 
NYKLGGHEGA LYLFWKRRPM ATCSVWKPNP NYIG* 



Fig. 2£A6 
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SCL 14 

LLKVIiLCHLV AESTKRRIKI RPLLDINDSG FLGFWSWIHM GSYPDGFPGS 
MDELDFNKDF DLPPSSNQTL GLANGFYLDD LDFSSLDPPE AYPSQNNHNN 
NINNKAVAGD LLSSSSDDAD FSDSVLKYIS QVLMEEDMEE KPCMFHDALA 
LQAAEKSLYE ALGEKDPSSS SASSVDHPER LASHSPDGSC SGGAFSDYAS 
TTTTTSSDSH WSVDGLENRP SWLHTPMPSN FVFQSTSRSN SVTGGGGGGN 
SAVYGSGFGD DLVSNMFKDD ELAMQ FKKGV EEASKFLPKS SQLFIDVDSY 
I PMNSGSKEN GSEVFVKTEK KDETEHHHHH S YAP PPNRLT GKKSHWRDED 
EDFVEERSNK QSAVYVEESE LSEMFDNMFL CGPGKPVCIL NQNFPTESAK 
WTAQSNGAK IRGKKSTSTS HSNDSKKETA DLRTLLVLCA QAVSVDDRRT 
ANVXLRQIRE HSSPLGNGSE RLAHYFANSL EARLAGTGTQ IYTALSSKKT 
SAADKLKAYQ TYMSVCPFKK AAIIFANHSM MRFTANANTI HIIDFGISYG 
FQWPALIHRL SLSRPGGSPK IiRITGIELPQ RGFRPAEEFR RQVIAWLDTV 
SDTMFRLSTT QLLRNGETIQ VEDLKLRQGE YVWNSLFRF RNLLDETVLV 
NSPRDAVLKL XRKINPNVFI PAILSGNYNA PFFVTRFREA LFHYSAVFDM 
CDSKLAREDE MRLKYVFEFY GREIVNWAS EGTERVESRE TYKQWQARLI 
RAGFRQIiPLE KELMQNLKLK IENGYDKNFD VDQNGNWLLQ GWKGRIVYAS 
SLWVPSSS* 



Fig. 2£>AH 
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| LEUCINE KEPT AD I 1 

I - * 1 ! B 1 

SCL) EVVDlilSl-LIKCXQAVXMXmRCXCQU^^ ?.SAAWu?A 

SCU.4 ETXDUmiVI^CAVSVnnmaiWXLROIREHSSPL-GfCSESLXH^ TSAAEMLZA 

SCL1 IjSMVNEI^QtVSIQ^DPSQRIXAYKVEniAWK?^ASGKFIYRAIiCC<E ?PSEE?iAA 

SCL8 TSVCSRQTVMEIATAIAEGKTEIXTEI^^ GKE HLI3 

SCL4 FDLH?PIXXATfDC\RISDSDPNEASKTI^IRES\reE^ TSSSSSSTEELtLS 

SCLfi GPVGITEQLVKAAE-VXESDraJU^IIJVRU^^^ LNPYSLIFXIAA 

SCX15 SSFCTIHELIiWVDCVESDELQLAQWI^^ ?IRLSSWSEr*WKA 

GAI NGVHI-VHAIiUCAEAVCKnttlVAEALV^ QSPIEHCLSOTL 

ROA NGVI^VHAIilACAEAIQQNNLTIAEALTOQICCTlA^ QNQICHSISDTL 

RGAL TOV^VKMJ^ACAEAVQQNNUOiAI^VCTTC DOVXSSSFSUTL 

SCR KimLTUWX^^VSADNLEEANKIJXEISQ^ 



| VHUD 1 

SOJ AMB3EKK VHVIELnASEPAWLAIXQAFNSRPEG PPHCJOTGVEHQ 

Sd.9 HQLFLACCPFRKLSYFITNKTIRDLVGNSQR VHVIBFGILYGFQWPTLIHRFSMyG SPKVJOTCIZFPOPGni 

SOJ.4 YQTYMSVCPFKKAAI IFANHSMMRFTANAWT IHIIEFCISYGFCWPALIKKI^LSRPGG SPFXRHOIELPQRGFR 

SCU6 LAEFVDLTPWHRFGFIAANAAILDAVB3YSS VHIVD£^THCM2rPTTJI^S£AWCLHJOCP PPIiKMVIASEAEFHP 

SCL13 AHVEILEAIAGETR VHIIDPQ1AQGSQYMFLIQELAKRPGG PPUJWTGVDOSQSRYA 

SCLS WHILYEACTYIXTCTESANSAIAEAVKKESF VHID^rSQGGQWVSI.ISlALGARPOG PPNVRZTCIDDPRSSFA 

SCXJ. MQVIJEVCSCFKFGFLAAM3AILEAIKGSEE VHItDFDINC3GNQVWn.IRSIAELPGK RPMJU.TGIDDPESVQR 

SCL8 TQIXVE!^R3TaX3FEAAraAIIJ3AAI»OnXXMCCPHV^ 

SCX4 mX»03ACPYSKFAHLTAM3AJXEATE3CSMK IHIVDFGIV03IQWPALLQALATRTSGK PTQUtVSGIPAPSLG— 

SCL6 YKSFSEISPVLQFANFTStCALLESFHCFHR UCTIDFDIGYGG0WASLM3ELVLRENAA PLSXJOTVFASPA 

SCL15 IKJEX5GISyiPI^SHFTAM3AIIJJSI^SQSSSPF^^ GGFLKVTAWA 

SCU.8 W*n^LTPFT3UFGHLTAM3AILI»TETN^^ 

GAI CMKFYETCPYLKFAHFTAI75AXLEAJFCGKKR VBVII)FSMSQGLQWPAIM3ALALRPGG PPVTOI.TCIGPPA 

RGA QMKFYETCTVtKFAHFTA«3AILEAFB3KKR VHVIEFSMNQGLQWALMQALALREGG PPTTRLTCICPPA 

SGAL QrKFVESCPYLKFAHFTAM3AILEVFATAEK VHVTDt/nJMGLOWPALIQALALRPNG PPDTRLTGI 

SCR FQVF^ISTLVKFSHFTANQAIQEAFEKBOS VOTimJIMQGMWIWTnXASRPGG PPHVRI/TOLGTSM 

I tEOCZHE HEPXAO II 1 | 

, * 1 , b 1 

sco evwamahrlisahocippqfnpwsrijxxj^ tgeaiav 9 dsflni 

SCLI1 K^-TITL3EIimjPC3ZrTVVNCIimC^ 

SOS PAQRVEETGQR-IAAYAKL-FCVPFEVKAIAKXWQA IQLCTU3n3PI3EITVVHCX.YBAENI4iDESVKVESCFXrri/U^ 

SCX14 PAEEFRRQVTA-WLI7rVSI7rM--FTll>ST^ 

SCL16 PPIXG I SYEELG SXL VNF ATTRNVXKE2TII ISSSYSCGLSSL rSQLRID PFVFtJEAiWNCHhMJlYI PJ2EJXTSN - LRSVFLKE 

SCL13 R GGG I^VGEMJmjU2SCGVPFEFra^-MSGCX--VOR£mjGl£P^ 

SCLS R QOT---LEI.VC^PXGRIA£t*DCWPFEr^ 

SCU S IGG UUXCLRLS31AEIX1GVSFKFKAMP-SK^ 

SCL3 VD — OKEEPXXAWDLJL^QIJSDRIXUSVSFN^^ 

SCX4 ESPEPSIJJvTGNRIJffiFAKVLOLNFDFIPrLTPIKL UCSSFRVDPDEVIAVNFMLQLYKLLEEI PTIVETAL-R 

SCLfi WlD3r,FT^FT^DKUggASEPreSIJI(?^IJ?I^SISWPNSS EKEAVAVHISAA S FSHLPLVLRFVKH 

SOJ.S EECAVEHILVKENLTQFAAEMKIRFQIEFVIJWKTFEKLSFKAXR-— FVEGERTWLtlSPA 1 FRHL3GITDFVNN 

SCI 8 NRTGDRLTRFAX^SI^LQFQFHTLVIVEEDLAGLLLQ IRIXM^AVQCTTXAVNCWlFLKKt FNDCGCKIGHFL- 

GAI P0NFTTY^HEVGCKIAHIAEA13WES^^ CGRPGAIDKVLGV 

SGA PDNSimHEVCXnaJVQrAEAIHVH?^^ LGRPGGIEXVLGV 

HGAL GYSLTDIQCVGWCtvGQrASTIGVNFEFKS lALNOT^DLKPEKLDIRPGLE - SVAVNSVFELHRL LAHPGS IDKFLST 

SCL19 POPVQSNKLLNT 

SCR EA LQATGXM^FTESCI^PFITCPL^HCVCWDLT ERLNVRKREAVAVH- -WLOHSL YDVTGSDAHTLWL 



Fig. 29A 
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PFXRE 1 . i 

SCLj -«GL3PrVKV/TSQCS cmjSTLMEHIJ^ESLVTYAAUTCI^T^^ -5CEC51S 

SCLil FSDEIPDLrVFAEING ^f^ISP5TlfI^^^^EALFHYSSIJ^>^FDTTIHaEDEy)Og^ST 7 K?~,T VRCMiSVI SCZGAZR 

SC9 IGKZOTDLFVFCIVN3 AY1^FFVTRTRE^lLFKFSS:LFTMI^ ACSGWER 

IKKIOTNVFIPAaLSG NYXAPFrVTRFMXI^YSAVFmn>SXIA^ ASEGTSR 

SC16 LRDLNPT IVTL rDEDSDFTSTN ,WV AK3GAER 

SCL13 IKSLSPKLVTLVEQES OTT^rePLVSRFVETLDYYTAMFESIQAARPRCCKvRIS — AJSQHCVARDIVNMI ACEESER 

SCLS VKHLSFNWTLVECEA JnT^APFIiPRFVEraNHYIAVFESIEVKZJUUSHKERIN — VEQHCLAKEWNLI ACEC V EK 

SCL1 VKSLOTKLVTWEQDV NINTSPrFPRFIEA3fEYYSAVFESU*fIXPRESQEJ^ ACSGEER 

SCL3 VKGLXPRWTLVEQEM NSOTAPrLCKVSESCAOTOALLESVESTVPSTOSCBAK- -VEEG- IGRKLVNAV ACEGIER 

SCL4 IJUOOTKWTLCEYEV SLNRVCTAMWKlttI«FTrSAVreSLEPNIX^^ TCIHR 

SCL7 SLEPNIZKDS3CERLR — VI3WIJGRRIHDLTOSDDDNNKPGTR 

SCLS L — -SPTIIVCSERGC ERTDU'FSCXLAHSIJISOTALFESLilA--VNANLDAM2K-- IERFLIQPEIEKLV LDR 

SCL15 U*KVSPKWVFVDSEGV^IAGSGSFREEFVSAIJE^^ ETAA-DR 

SCL18 -SAIRSU^RIVTMAEREANHGDHSTT^O^ 

GAI VNQOTEIFIWEQES KHNSPITIi»FreSUIYTSTLFBSI3W--PSGQa^ ACDGPDR 

RGA VKQnCWITTVVEQES NHNSPVJTJSRFTESLHlfXSTXFDSLBGV — PNSQOKVM — SEVYIj-GKQICHLV ACSGPDR 

RGAL IKS IRPDIKTWEQEA NKNSTVTXjDRFTESLKTCSSIJFDSLEGP P SQCKVM — SELFL-GRQILNt.V ACEGEER 

SCL19 VKATKPSIVTWEOEA NHTCIVFl^RFKEAIilYrSSLFDSI^SYSI^ AAEGSER 

SCR LQRLAPKWTWBQO L^K^SFLGRFVEAJJTSncSALFDSlJSASVGEESEERHV' — VBQQLLSKEXRHVL, AVGGPSR 



SCL3 REJUtEK-LEKWSQRIDLACFSJVPLSYY^ 

SCL11 FARPrr-YKCWWRIIJ*ACTTa>ATISKQIMKEAKEir^ 

SCL9 VE^PET- YKOWHVKAMRSGL.VCVPFDPS IMKTSLHKVHT- FYHKDFVI - DQCNPWLLQGWKGRTVMALSVWKPES 

SCL14 VESRrr-YKQWQABLIRAOFliaiJ'LEKEl>J3NIJ5^^ 

SCL16 VERIEP FTGVGFGETAKlliVKIMEXEHATGWtQQ<WlXXX^DVERF^ 

SCX.13 VERHEV-LCSCWRVFMMMAGFTGWPVSTSAAFAASEMLiKA — YDKNYKL-CGHEGALYI^VKRRPMATCSVWKPNPMYTG 

SCLS EEJUil^-LGKWRSHFHHAX^TCPYPLSSYWATDCGLLES — YSEKYTL-EERDGAI*YljGWKM2PLITSCA>fR 

SCL1 TFPwa - tmii a 3MV>ral^Fm>^rPM<;AKVTO^^TOW■TTr>^y<--^^^v^a .- rCTrM^rT-Hwr-Mgirr^ 71^7 p^p 

SCX3 IZRCEV-FgKWRMBMSKAGFELMPI^EKIAESMKSR-<^^ 

SCW ERMEE--KEQ*WVU1ENAGFESVKI,SNYAVSQA^ 

SCX6 SRP XE^PHrcTWDAMFlJQMGF S PVTHSNFTESQ AEjCLVQR- T P7PGFH~VEKF3INSIXIjC><>RTELVGVSAKRCRSS 

SCL1S RKTCE KTWREAFCAA£MRP IQO^QFAI^^AEClXjEK-AQVFGFH-VAf^Q^LVLCVSiQ^ALVATSAHPF 

SdJ.8 HRRFE WEEKMKJUFXIFWVPIGSFAI^AKIXI^ 

GAI VERHCT-L^3WP*NRFGSA^AAAHIGSNAFKQASKC^-ALFN3C»^^ 

XJA VEREET-LSQK»RFCSS<n^AHI/35NAFKQASaZ^^ 

RGAL VERHET - LNQWRNRFGIjGGFKPVS XGSKAYKQASMIXALYAGAIX7YN- VEEKEGCIZXGfQTRPLIATSAHRIIJRyE 

SCL19 VERHET-AAQWRIRKXSAGFDPIHL^SSAFKQASKLI^^ 

SCR - - SGEVKTZSWREKMQQCGFKG ISLAGMAATQATLLLGKF P - SDGYTT-VDDM -GTLKLGWKDLSLLTASAWTPRS 
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CBPBT44 partial cDNA sequence 

GCGGCCGCGCAGAGCCGCCGCGTGGCGGTGGCGTTCCAGGCGTACAACGCGCTGTCGCCG 
CTCGTCAAGTTCTCGCACTTCACGGCCAACCAGGCCATCCTGCAGGCGCTCGACGGCGAG 
GACTGCCTCCACGTGATCGACCTGGACATCATGCAGGGCCTGCAGTGGCCGGGGCTCTTC 
CACATCCTCGCGTCCCGCCCGCGCAAGCCGCGGTCGCTCCGGATCACCGGGCTCGGCGCG 
TCGCTCGACGTCCTCGAGGCCACTGGCCGCCGCCTCGCCGACTTCGCGGCCTCGCTCGGC 
CTCCCGTTCGAGTTCCGCCCCATCGAGGGGAAGATCGGGCACGTCGCCGACGCCGCGGCG 
CTCCTCGGCTCGCGCCAGCGGCGGCGGGATGACGAGGCCACCGTGGTGCACTGGATGCAC 
CACTGCCTCTATGACGTGACGGGGTCGGACGTGGGCACGGTGCGGCTGCTCCGGAGCCTG 
CGCCCGAAGCTGATCACCATCGTGGAGCAGGACCTGGGCCACAGCGGQGATTTCCTGGGC 
CGGTTCGTGGAGGCGCTGCACTACTACTCGGCGCTGTTCGACGCGCTGGGAGACGGCGCC 
GGCGCGGCCGAGGAGGAGTCGGCCGAGCGGTACGCGGTTGAGCGACAGCTCCTGGGCGCG 
GAGATACGCAACATCGTGGCCGTAGGGGGGCCCAAGCGGACAGGGGAGGTGCGCGTGGAG 
CGGTGGAGCCACGAACTGCGGCACGCCGGGTTCCGGCCAGTGTCCCTGGCCGGGAGCCCT 
GCCGCGCAGGCCAGGCTGCTCCTCGGCATGTATCCGTGGAAGGGGTACACGCTGGTGGAG 
GAGGACGCGTGCCTTAAGCTGGGCTGGAAGGACCTCTCCCTGCTCACCGCGTCGGCGTGG 
GAGCCGGCGGACGACGCTGCCGCTTCTGCGCCCACCGGTTAACGAGTACGAGCGGACGCG 
TGGGTCGAC 



CBPBT44 partial amino acid sequence 



AAAQSRRVAVAFQAYNALSPLVKFSHFTANQAILQALDGEDCLHVIDLDIMQGLQWPGLF 
HILASRPRKPRSLRITGLGASLDVLEATGRRLADFAASLGLPFEFRPIEGKIGHVADAAA 
LLGSRQRRRDDEATWHWMHHCLYDVTGSDVGTVRLLRSLRPKLITIVEQDLGHSGDFLG 
RFVEALHYYSALFDALGDGAGAAEEESAERYAVERQLLGAEIRNIVAVGGPKRTGEVRVE 
RWSHELRHAGFRPVSLAGSPAAQARLLLGMYPWKGYTLVEEDACLKLGWKDLSLLTASAW 
EPADDAAASAPTGXRVRADAWVD 
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Zm SCR GRVAAAFQVF NGISPFVKFS 

CBPBT44 RRVAVAFQAY NALSPLVKFS 

At SCR LKMVSAFQVF NGISPLVKFS 



Zm SCR HFTANQAIQE AFEREERVHI IDLDIMQGLQ WPGLFHILAS RPGGPPRVRL 

CBPBT44 HFTANQAILQ ALDGEDCLHV IDLDIMQGLQ WPGLFHILAS RPRKPRSLRI 

At SCR HFTANQAIQE AFEKEDSVHI IDLDIMQGLQ WPGLFHILAS RPGGPPHVKL 

Zm SCR TGLGASMEAI. EATGKRLSDF ADTLGLPFEF CAVAEKAGNV DPEKLGVTRR 

CBPBT44 TGLGASLDVL EATGRRLADF AASLGLPFEF RPIEGKIGHV ADAAALLGSR 

At SCR TGLGTSMEAIj QATGKRLSDF TDKLGLPFEF CPLAEKVGNL DTERLNVRKR 

Zm SCR EAVA VHWLHHSLYD VTGSDSNTLW LIQRLAPKW TMVEQDLSHS 

CBPBT44 QRRRDDEATV VHWMHHCLYD VTGSDVGTVR LLRSLRPKLI TIVEQDLGHS 

At SCR EAVA VHWLQHSLYD VTGSDAHTLW LLQRLAPKW TWEQDLSHA 

Zm SCR GSFLARFVEA IHYYSALFDS LDASYGEDSP ERHV VEQ QLLSREIRNV 

CBPBT44 GDFLGRFVEA LHYYSALFDA LGDGAGAAEE ESAERYAVER QLLGAEIRNI 

At SCR GSFLGRFVEA IHYYSALFDS LGASYGEESE ERHV VEQ QLLSKEIRNV 

Zm SCR LAVGGPARTG DVKFGSWREK LAQSGFRAAS LAGSAAAQAS LLLGMFPSDG 

CBPBT44 VAVGGPKRTG EVRVERWSHE LRHAGFRPVS LAGSPAAQAR LLLGMYPWKG 

At SCR LAVGGPSRSG EVKFESWREK MQQCGFKGIS LAGNAATQAT LLLGMFPSDG 

Zm SCR YTLVEENGAL KLGWKDLCLL TASAWRPIQV PPCR 

CBPBT44 YTLVEEDACL KLGWKDLSLL TASAWEPADD AAASAPTG 

At SCR YTLVDDNGTL KLGWKDLSLL TASAWTPRS 
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